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Quantum Heisenberg models and their probabilistic 
representations 

Christina Goldschmidt, Daniel Ueltschi, and Peter Windridge 

Abstract. These notes give a mathematical introduction to two seemingly 
unrelated topics: (i) quantum spin systems and their cycle and loop representa- 
tions, due to Toth and Aizenman-Nachtergaele; (ii) coagulation-fragmentation 
stochastic processes. These topics are nonetheless related, as we argue that 
the lengths of cycles and loops effectively perform a coagulation-fragmentation 
process. This suggests that their joint distribution is Poisson-Dirichlet. These 
ideas are far from being proved, but they are backed by several rigorous results, 
notably of Dyson-Lieb-Simon and Schramm. 
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1. Introduction 

We review cycle and loop models that arise from quantum Heisenberg spin 
systems. The loops and cycles are geometric objects defined on graphs. The main 
goal is to understand properties such as their length in large graphs. 

The cycle model was introduced by Toth as a probabilistic representation of 
the Heisenberg ferromagnet [48], while the loop model is due to Aizenman and 
Nachtergaele and is related to the Heisenberg antiferromagnet pQ . Both models are 
built on the random stirring process of Harris |29j and have an additional geometric 
weight of the form $# c y cles or $# 1oo p s with parameter § = 2. Recently, Schramm 
studied the cycle model on the complete graph and with $ = 1 (that is, without 
this factor) [45] , He showed in particular that cycle lengths are generated by a 
split-merge process (or "coagulation-fragmentation" process), and that the cycle 
lengths have Poisson-Dirichlet distribution with parameter 1. 

The graphs of physical relevance are regular lattices such as 7L d (or large fi- 
nite boxes in Z d ), and the factor 2# ob j° cts needs to be present. What should we 
expect in this case? A few hints come from the models of spatial random per- 
mutations, which also involve one-dimensional objects living in higher dimensional 
spaces. The average length of the longest cycle in lattice permutations was com- 
puted numerically in |26j . In retrospect, this suggests that the cycle lengths have 
the Poisson-Dirichlet distribution. In the "annealed" model where positions are 
averaged, this was proved in [9]; the mechanisms at work there (i.e., Bose- Einstein 
condensation and non-spatial random permutations with Ewens distribution), how- 
ever, seem very specific. 

We study the cycle and loop models in Z d with the help of a stochastic process 
whose invariant measure is identical to the original measure with weight $# c y clcs or 
$ #loops , and which leads to an effective split-merge process for the cycle (or loop) 
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lengths. The rates at which the splits and the merges take place depends on This 
allows us to identify the invariant measure, which turns out to be Poisson-Dirichlet 
with parameter While we cannot make these ideas mathematically rigorous, 
they are compatible with existing results. 

As mentioned above, cycle and loop models are closely related to Heisenberg 
models. In particular, the cycle and loop geometry is reflected in some important 
quantum observables. These observables have been the focus of intense study by 
mathematical and condensed matter physicists, who have used imagination and 
clever observations to obtain remarkable results in the last few decades. Most 
relevant to us is the theorem of Mermin and Wagner about the absence of magnetic 
order in one and two dimensions [38 , and the theorem of Dyson, Lieb, and Simon, 
about the existence of magnetic order in the antiferromagnetic model in dimensions 
3 and more [17] . We review these results and explain their implications for cycle 
and loop models. 

Many a mathematician is disoriented when wandering in the realm of quantum 
spin systems. The landscape of 2 x 2 matrices and finite-dimensional Hilbert spaces 
looks safe and easy. Yet, the proofs of many innocent statements are elusive, and one 
feels quickly lost. It has seemed to us a useful task to provide a detailed introduction 
to the Heisenberg models in both their quantum and statistical mechanical aspects. 
We require various concepts from stochastic process theory, and will need to describe 
carefully the split-merge mechanisms and the Poisson-Dirichlet distribution. The 
last two are little known outside of probability and are not readily accessible to 
mathematical physicists and analysts, since the language and the perspective of 
those domains are quite different (see e.g. the dictionary of 1 2 1 j . p. 314, between 
analysts' language and probabilists' "dialect"). In these notes, we have attempted 
to introduce these different notions in a self-contained fashion. 

1.1. Guide to notation. The following objects play a central role. 



A=(V,£) 

qU) q 
^X , ^>X 

(•}A,/9,ft 

Za, Fa 
PA,p(duj) 



7 

■d > 

A„ = (V„,£„) 

m 

<7oo i ?7macro 

PD e 

(£*,*> 0) 



A finite graph with undirected edges. 
Spin operator s (j|2 .2| 



Gibbs state ([2.4) 



Partition function and free energy ([2.4). 



Probability measure for Poisson point processes on [0, /3] (/? > 
0) attached to each edge of A (defined in 



3.1) 



Cycle and loop configurations constructed from edges in ui 



(Q. 

Cycle in C(lu) or loop in L(uj). 

Geometric weight involving the number of cycles a nd lo ops. 
Box {1, . . . , n} d in 7L d with nearest-neigbo r ed ges ([ 4.1 ). 
Various definitions of the magnetization ([ 4.2 ). 
Antiferromagnetic long-range order ([4.3). 



Fractions of vertices in infinite or macroscopic cycles/loops 



([4.4) 



Countable partitions of [0, 1] with parts in decreasing order, 
i.e. {pi>P2>...>0:Eift = l}' 



Poisson-Dirichlet distribution with parameter 9 on Ai ([7.2). 
Stochastic process with invariant measure given by our cycle 



and loop models ([8.2) 
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2. Hilbert space, spin operators, Heisenberg Hamiltonian 

We review the setting for quantum lattice spin systems described by Heisenberg 
models. Spin systems are relevant for the study of electronic properties of condensed 
matter. Atoms form a regular lattice and they host localized electrons, which 
are characterized only by their spin. Interactions are restricted to neighboring 
spins. One is interested in equilibrium properties of large systems. There are 
two closely related quantum Heisenberg models, which describe ferromagnets and 
antiferromagnets, respectively. The material is standard and the interested reader 
is encouraged to look in the references |44L 1461 1391 119) for further information. 



2.1. Graphs and Hilbert space. Let A = (V, £) be a graph, where V is a 
finite set of vertices and £ is the set of "edges" , i.e. unordered pairs in V x V. From 
a physical perspective, relevant graphs are regular graphs such as Z d (or a finite box 
in Z d ) with nearest-neighbor edges, but it is mathematically advantageous to allow 
for more general graphs. We restrict ourselves to spin-| systems, mainly because 
the stochastic representations only work in this case. 

To each site x € V is associated a 2-dimensional Hilbert space T~L X = C 2 . It is 
convenient to use Dirac's "bra", (-|, and "ket", |-), notation, in which we identify 



(2.1) 



The notation (f\g) means the inner product; we use the convention that it is linear 
in the second variable (and antilinear in the first). Occasionally, we also write 
(f\A\g) for (f\Ag). The Hilbert space of a quantum spin system on A is the tensor 
product 

W (V) =(g)?4, (2.2) 

which is the 2' v ' dimensional space spanned by elements of the form ® x ^vfx with 
fx € H x - The inner product between two such vectors is defined by 



(2.3) 



The inner product above extends by (anti)linearity to the other vectors, which are 
all linear comb inati ons of vectors of the form ® X £.vfx- 



The basis (2.1| of C 2 has a natural extension in Tl^; namely, given = 
(s x ) xe v with s x = ±|, let 

l* (v) > = ®k»>- 

xeA 

These elements are orthonormal, i.e. 

(aM\sP») = H(s x \s x ) = J] ^., s „ (2.5) 
xev xev 



where S is Kronecker's symbol, 5 a b = 1 if a = b, otherwise. 
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2.2. Spin operators. In the quantum world, physically relevant quantities 
are called observables and they are represented by self-adjoint operators. The op- 
erators for the observable properties of our spin-^ particles are called the Pauli 
matrices, defined by 



i or 2 v i o ) ' 2 vo -i 



(2.6) 



We interpret as the spin component in the i th direction. The matrices are 
clearly Hermitian and satisfy the relations 

[SW,sW]=iSW, [SW,sW] = iSW, [SW,S<»] =LS (2) . (2.7) 

These operators have natural extensions as spin operators in T-L^ v \ Let ieV, 
and write — H x ® %( V \W). We define operators S x ^ indexed by x £ V by 

S« =5 W ® Id V \ {x} . (2.8) 

I I f£\ 

The commutation relations (2.7 1 extend to the operators S x , namely 

[SW.^J^i^, (2,) 

and all other relations obtained by cyclic permutations of (123). Indeed, it is not 
hard to check that the matrix elements (s^ | • \s^) of both sides are identical for all 
S (V) g {— |, \} v ■ It is customary to introduce the notation S x = (Sx , S x 2 \ S x 3 ' ') , 
and 

' — — S x ^ Sy ^ ~\~ ^ Sy ^ ~\~ ^ Sy ^ . (2.10) 

Note that operators of the form SxSy, with i/j, act in %W) = % x ® % y ® 
-U(v\{x, v }) as follows 

SjpS® = ® S<« ® Id v \ { * )1/} . (2.1D 
In the case x — y, and using (S x ^) 2 = |ldy, we get 

^ = (4 1 )) 2 + (^)) 2 + (^)) 2 =|ld V . (2,2, 

Lemma 2.1. Consider S x ■ S y in H x ® H y . It is self-adjoint, and its eigenvalues 
and eigenvectors are as follows: 

• — | is an eigenvalue with multiplicity 1; the eigenvector is (| g ,— |) — 

|->*»; 

• j is an eigenvalue with multiplicity 3; three orthonormal eigenvectors are 

\- -) I - 1 -M -Mil -i\ + 1 - i I\l 

I 2 ' 2 / ' I 2' 2/' v^Vl2' 2/ ' I 2'2//- 

The eigenvector corresponding to — | is called a "singlet state" by physicists, 
while the eigenvectors for j are called "triplet states" . 

Proof. We have for all a, b = ±|, 

SPsM\a,b) = l\-a,-b), 

S x 2) S y 2) \a,b) = -<z&| - a, -6), (2.13) 
S( 3) 4 3) |M) =ah|a,6). 
The lemma then follows from straightforward linear algebra. □ 
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2.3. Hamiltonians and magnetization. We can now introduce the Heisen- 
berg Hamiltonians, which are self-adjoint operators in T-L^ v \ 



rjfcrro 
H A,h 



Tj ant i 
H AJi = 



E 

{x,y}e£ 

E StC ' Sy 

{x,y}££ 



/>E^> 

x£V 



^E^ 3) - 



(2.14) 



Let us briefly discuss the physical motivation behind these operators. One is in- 
terested in describing a condensed matter system where atoms are arranged on a 
regular lattice. Each atom hosts exactly one relevant electron. Each electron stays 
on its atom and its spin is described by a vector in the Hilbert space C 2 . A system 
of two spins is described by a vector in C 2 <£> C 2 . The singlet and triplet states 
of Lemma |2.1| are invariant under rotation of the spins and they form a basis. In 
absence of external magnetic field, the energy operator should be diagonal with 
respect to these states, and there should be one eigenvalue for the singlet, and one 
other eigenvalue for the triplets. Up to constants, it should then be ±S X ■ S y . It 
is natural to define the total energy as the sum of nearest-neighbor interactions. 
Taking into account the contribution of the external magnetic field, which can be 
justified along similar lines, we get the Hamiltonians of (2.14 1. 

Next, let M A be the operator that represents the magnetization in the 3rd 
direction. 



M 



(3) 



E^ 3) - 



(2.15) 



xev 



Lemma 2.2. The Hamiltonian and magnetization operators commute, i.e. 

[H A>h ,M A }=0. 



Proof. This follows from the commutation relations (2.9). Namely, using the 
commute for x =/= y, 

[S X ■ Sy, S^] 



fact that and 5, (3) 



[H A , h ,M A ] = 



y 

E 

{x,y}£S,z£V 



E (EW> + FW } > ^ + [S?>S&>,Sl*>] 

{x,y}e£ 



[0(2)0(2) c(3)l 



The first commutator is 

[<?(!) <?( x ) S" (3) l - [ 

[O x D y ,D X J — I 

and the others are similar. We get 
[H Aih ,M A ]=i (-4 2) S^ 

{x,y}££ 



O(l) 0(3)10(1) 

KJ X , kJ x \^y 



.i,o(2) o(l) 
^ x y 



O(l) o(2) 

x 



. o(l) 0(2) 
^ x ^y 



O(2)o(l) 



0. 



(2.16) 



(2.17) 



(2.18) 



□ 



2.4. Gibbs states and free energy. The equilibrium states of quantum 
statistical mechanics are given by Gibbs states (-) A ,/3,h- These are nonnegative 
linear functionals on the space of operators in of the form 
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where the normalization 

Z K {P,h) = Tr e - pH ^ h (2.20) 

is called the partition function. Here, Tr denotes the usual matrix trace. 

There are deep reasons why the Gibbs states describe equilibrium states but 
we will not dwell on them here. We now introduce the free energy F A ((3,h). Its 
physical motivation is that it provides a connection to thermodynamics. It is a kind 
of generating function and it is therefore mathematically useful. The definition of 
the free energy in our case is 

F A (p,h) = -^logZ A (P,h). (2.2i) 

Lemma 2.3. The function (3F A (/3,h) is concave in (/3,f3h). 

PROOF. We will rather check that —f3F A is convex, which is the case if the 
matrix 

/ 3 2 /3F A 9 2 /3F A \ 
I 9/3 2 d0d(0h) \ 
I d 2 /3F A g> 2 /3F a I 
\dpd(f3h) d(0h) 2 / 

is positive definite. Let us write (•) instead of (•} A> j h- We have 
^L(3F A (p,h) = -{(H K0 -(H Afl )) 2 ), 

f3F A (l3,h) = -{(M A -(M A )) 2 ), (2.22) 



d(/3h) 2 
d 2 



-8F A (f3, h) = ({H Afi - (H A . ))(M A - (M A ))). 



dj3d(j3h)' 
Then F A is convex if 

({H Aj0 - (H Afi ))(M A - (Af A ))) 2 < {(H Afi - (Ha, )) 2 )((M a - (Ma)) 2 ). (2.23) 

It is not hard to check that the map (A, B) n- (A* B) is an inner product on the 
space of operators that commute with H Ai h. Then 

\(A*B)\ 2 < (A*A)(B*B) (2.24) 



by the Cauchy-Schwarz inequality and, in particular, this implies (2.23|. □ 

Concave functions are necessarily continuous. But it is useful to establish that 
F A ((3,h) is uniformly continuous on compact domains. This property will be used 
in Section [4.11 which discusses the existence of infinite volume limits. 



Lemma 2.4. 

|/?F A (/3, h) - P'F A (P',h')\ < 1/3- p\{\\e\ + |V|) + lP\h - h'||V|. 
Proof. We have 

f P d rP 
pF A (p,h)-p'F A {p',h)= —sF A {s,h)<l s = (H A<h ) A>Sth ds. (2.25) 

J0' ds J , 

We can also check that (3F A (f3, h)-(3F A (f3, ti) = (M A ) A ,/3, s ds. The result follows 
from \{A} A: j3^\ < \\A\\ for any operator A, and from \\S X ■ S y \\ = | (cf Lemma 2.1) 
and ||^ 3) || = i. □ 
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2.5. Symmetries. In quantum statistical mechanics, a symmetry is repre- 
sented by a unitary transformation which leaves the Hamiltonian invariant. It 
follows that (finite volume) Gibbs states also possess the symmetry. However, 
infinite volume states may lose it. This is called symmetry breaking and is a man- 
ifestation of a phase transition. We only mention the "spin flip" symmetry here, 
corresponding to the unitary operator 

Uls^) = \~S iV) ). (2.26) 

One can check that U^S^S^U = S ( y l) and U^S^U = -Si 3) . It follows 
that 

U^H^.hU = H Al - h , (2.27) 

This applies to both the ferromagnetic and antiferromagnetic Hamiltonians. It 
follows that F\(f3, —ft) = F\(j3, ft), and so the free energy is symmetric as a function 
of ft. 

3. Stochastic representations 

Stochastic representations of quantum lattice models go back to Ginibre, who 
used a Peierls contour argument to prove the occurrence of phase transitions in 
anisotropic models [27 . Conlon and Solovej introduced a random walk represen- 
tation for the ferromagnetic model and used it to get an upper bound on the free 
energy [13j . A different representation was introduced by Toth, who improved 
the previous bound |48) . Further work on quantum models using similar repre- 
sentations include the quantum Pirogov-Sinai theory [111 115] and Ising models in 
transverse magnetic field [301 114L I28j . 

A major advantage of Toth's representation is that spin correlations have natu- 
ral probabilistic expressions, being given by the probability that two sites belong to 
the same cycle (see below for details). A similar representation was introduced by 
Aizenman and Nachtergaele for the antiferromagnetic model, who used it to study 
properties of spin chains pQ. The random objects are a bit different (loops instead 
of cycles), but this representation shares the advantage that spin correlations are 
given by the probability of belonging to the same loop. 

The representations due to Toth and Aizenman-Nachtergaele both involve a 
Poisson process on the edges of the graph. The measure is reweighted by a function 
of suitable geometric objects ("cycles" or "loops"). We first describe the two models 



in Section |3.1| we will relate them to the Heisenberg models in Sections |3.3| and 

El 



3.1. Poisson edge process, cycles and loops. Recall that A = (V, £) is 
a finite undirected graph. We attach to each edge a Poisson process on [0, (3} of 



unit intensity (see ^7.2.1 for the definition of a Poisson point process). The Poisson 
processes for different edges are independent. A realization of this "Poisson edge 
process" is a finite sequence of pairs 

ui = ((ei,ii), . . . , (e k ,t k )). (3.i) 

Each pair is called a bridge. The number of bridges across each edge, thus, has 
a Poisson distribution with mean /3, and the total number of bridges is Poisson 
with mean (3\£\. Conditional on there being k bridges, their times of arrival are 
uniformly distributed in {0 < t\ < t 2 < ■ ■ ■ < t k < j3} and the edges are chosen 
uniformly from £ . The corresponding measure is denoted p A p(du). 
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To each realization u) there corresponds a configuration of cycles and configura- 
tion of loops. The mathematical definitions are a bit cumbersome but the geometric 
ideas are simpler and more elegant. The reader is encouraged to look at Figure [T] 
for an illustration. 



cycles y \_ loops 




A B A B A B A 

Figure 1. Top: an edge Poisson configuration uj on V x [0,/3] per - 
Bottom left: its associated cycle configuration. Bottom right: its 
associated loop configuration. We see that |C(u;)| = 3 and |£(w)| = 
5. 



We consider the cylinder V x [0,/?] per , where the subscript "per" indicates 
that we consider periodic boundary conditions. A cycle is a closed trajectory on 
this space; that is, it is a function 7 : [0, L] — > V x [0, /3] por such that, if 7(7-) = 
(x(t), i(r)), we have: 

• 7(7") is piecewise continuous; if it is continuous on the interval / C [0, L], 
then x(t) is constant and £pt(r) = 1 in /. 

• 7(7") is discontinuous at r iff the pair (e,t) belongs to w, where t = t(r) 
and e is the edge {x(t— ), x(t+)}. 

We choose L to be the smallest positive number such that 'f(L) — 7(0). Then L 
is the length of the cycle; it corresponds to the sum of the vertical legs in Figure 
[T] and is necessarily a multiple of (3. Let us make the cycles semi-continuous by 
assigning the value 7(7") = 7(1"—) at the points of discontinuity. We identify cycles 
whose support is identical. Then to each ui corresponds a configuration of cycles 
C(lj) whose supports form a partition of the cylinder V x [0, /3] pcr . The number of 
cycles is |C(w)|. 

Loops are similar, but we now suppose that the graph is bipartite. The A 
sublattice possesses and orientation, which is reversed on the B sublattice. We still 
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consider the cylinder V x [0,/3] per - A loop is a closed trajectory on this space; that 
is, it is a function 7 : [0,L] — >• V x [0,/3] per such that, with 7(7-) = (x(r),f(r)): 

• 7(7") is piecewise continuous; if it is continuous in interval / C [0, L], then 
x(t) is constant and, in /, 

d , . ( 1 if x(t) belongs to the A sublattice, 

— t(T) = < (3.2) 

(XT 1—1 if x(t) belongs to the B sublattice. 

• 7(r) is discontinuous at r iff the pair (e, t) belongs to w, where i = t(r) 
and e is the edge {x{t— ), x(r+)}. 

We choose L to be the smallest positive number such that j(L) — 7(0). Then L 
is the length of the loop; it corresponds to the sum of the vertical legs in Figure 
[I] (as for cycles) , but it is not a multiple of j3 in general (contrary to cycles) . We 
also make the loops semi-continuous by assigning the value 7(1") = 7(t— ) at the 
points of discontinuity. Identifying loops whose support is identical, to each ui 
corresponds a configuration of loops C{oj) whose supports form a partition of the 
cylinder V x [0,/3] pcr . The number of loops is |£(w)|. 

As we shall see, the relevant probability measures for the Heisenberg models 
(with h = 0) are proportional to 2^ c ^p £ j(duj) and 2^^ p £)(8 (do;). 

3.2. Duhamel expansion. We first state and prove Duhamel's formula. It 
is a variant of the Trotter product formula that is usually employed to derive 
stochastic representations. 

Proposition 3.1. Let A,B be n x n matrices. Then 

C A+B =C A + I' e tA Be (l-t)(A+B) dt 



(3.3) 







= J2 [ dh...dt k e^ A B e^~^ A B...B e^-^ A 

fe > -/0<ti<-<tjt<l 

Proof. Let F(s) be the matrix- valued function 

F(s) = e sA + f e^Be^^di. 
Jo 

We show that, for all s, 

eS (A+B) = F(s) _ (34) 

The derivative of F(s) is 

F'(s)= e sA A+ e sA B+ f e tA Be^''^' (A + B)dt = F(s){A + B). (3.5) 

Jo 

On the other hand, the derivative of e s(A+s) is e s(A+B) (A+B). The identity {O} 
clearly holds for s = and, since both sides satisfy the same differential equation, 
they must be equal for all s. 

We can iterate Duhamel's formula N times so as to get 



N 

c a+b = y dh...dt k e tlA B e^--^ A B...B e^ 1 "'* 

'o<ti<—<t k <i 



k=0 



/ dh...dt k e tlA Be {t2 - tl)A B ...B 

/0<ti <•••<* jv<1 



e (l-t N )(A+B) _ e (l-t N )A 



(3.6) 
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Using || e tA \\ < e*"' 4 " , the last term is less than 2 ell^H + H- 8 " and so it vanishes 

in the limit N — > oo. The summand is less than ell' 4 " , so that the sum is 
absolutely convergent. □ 

Our goal is to perform Duhamel's expansion on the Gibbs operator e~ l3HA - h , 
where the Hamiltonian is given by a sum of terms indexed by the edges and by 
vertices. The following corollary applies in this case. 

Corollary 3.2. Let A and (he), e G £, be matrices in W( v \ Then 

e ^+£ ee£ M = J Ap £ ,p{u) e tlA h ei e^~^ A h e2 ... h e „ e^-^ A , 

where (t\, e{), . . . , (t&, e&) are the bridges in uj. 

PROOF. We can expand the right side by summing over the number k of events, 
then integrating over < t\ < ■ ■ ■ < tk < ft for the times of occurrence, and then 
summing over edges e±, . . . , G E. After the change of variables t[ — ti//3, we 
recognize the second line of Proposition |3.1| □ 



3.3. Toth's representation of the ferromagnet. It is convenient to intro- 
duce the operator T XiV which transposes the spins at x and y. In % x ® T-L y , the 
operator acts as follows: 

T x>y \a,b) = \b,a), a,b = ±\. (3.7) 

This rule extends to general vectors by linearity, and it extends to by tensoring 
it with Idy\/ X y j. Using Lemma 2.1 it is not hard to check that 

Sx ' Sy — ~^T X y 4"^{z,y}' (3-8) 

Recall that C(ui) is the set of cycles of w, and let j x G C(uS) denote the cycle 
that intersects (a;, 0) G V x [0, /3] por . Let L(j) denote the (vertical) length of the 
cycle 7; it is always a multiple of (3 in the theorem below. 

Theorem 3.3 (Toth's representation of the ferromagnet). The partition function, 
the average magnetization, and the two-point correlation function have the following 
expressions. 



Zt m W,h)= e^l £ l fdpeA") II (2co8h(fcL( 7 ))), 

TrSi 3 >e- 20 <"° = ie-fl f l /d P£ ^H tanh(^L(7 x )) J] (2 cosh(/iL( 7 ))) , 

J iec(Lj) 



7 ec(u 

1 iflx=ly, 

tanh(/iL( 7a; )) tanh(/iL(7 2/ )) 1/7^ ^ 7l/ . 
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1_ 1 1 1 I 1 1 

2 2 2 2 ~2 2 2 




1 I I I L I I 

2 2 2 2 ~2 2 2 



Figure 2. Each cycle is characterized by a given spin. 
Proof. The partition function can be expanded using Corollary |3.2| so as to 

get 

Zf r °(2/3,/i) = e-^l Tr e ^M K+ ^ e T B ) 

= e"^ f d P£ , H^( S ( v )|e 2 * 1 ' lMA T ei ...r efc e 2 (' 3 - t '=" lMA | S ( v )), 
■* s (v) 

(3.9) 

where (ei,ii), . . . , (ek,tk) are the times and edges of lu. Observe that the vectors 
|s( v )) are eigenvectors of e tMA . It is not hard to see that the matrix element above 
is zero unless each cycle is characterized by a single spin value (see illustration in 
Figure [2]). If the matrix element is not zero, then it is equal to 

( s M\e 2t i hM *T ei ...T ek e 2 « 3 - t ^ hM *\sW) = [J e 2hL ^ s ^ (3.10) 

7£C(cj) 

with 5(7) the spin of the cycle 7. After summing over 5(7) = ±A, each cycle 
contributes e hL ^ + e~ /lL ' 7 - ) = 2cosh(/iL(7)), and we obtain the expression for 
the partition function. 

The expression which involves S x is similar, except that the cycle j x that 
contains ix{0} contributes \ e hL ^^ — i e - hL iy^) = smh{hL(j x )). Since the factor 
2 cosh (hL (7a;)) appears in the expression, it must be corrected by the hyperbolic 
tangent. 

(3) (3) 

Finally, the expression that involves S x S y has two terms, corresponding to 
whether (x, 0) and (y, 0) find themselves in the same cycle or not. In the first case, 
we get 2 cosh(hL(j xy )), but in the second case we get smh(hL(j x )) sinh(/iL(7 !/ )), 
which eventually gives the hyperbolic tangents. □ 

It is convenient to rewrite the cycle weights somewhat. Using 2 cosh(/iL(7)) = 
e hL(i) (1 + e -2h£(7) ) anc i £C , , L(j) = /3|V|, the relevant probability measure 
for the cycle representation can be written 

PZ^) = &r°W,h)- 1 e-^ + M v Up £ A<^) II (l+e- 2,liW ). (3,i) 
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This form makes it easier to sec the effect of the external field h > 0. Notice 
that the product over cycles simplifies to 2l c ("" when the external field strength 
vanishes (i.e. h = 0). Then, in terms of the cycle model, the expectation of the 
spin operators and correlations are given by 

(Si 3) ) A ,20,h = |E^ s (tanh(/ l L( 7K ))) (3.12) 

and 

(3.13) 

+ iKth [Kfrv tanh(/ l L( 7x )) tanh(M,( 7 ,))] . 
(3) 

In the case h = 0, we see that (S x }a, 2/3,0 = 0, as already noted from the spin flip 
symmetry, and 

<Si 3 >Sf) A , 20 ,O = \PZ^( lx = ly ). (3,4) 

That is, the spin-spin correlation of two sites x and y is proportional to the proba- 
bility that the sites lie in the same cycle. 

3.4. Aizenman-Nachtergaele's representation of the antiferromagnet. 

The antiferromagnetic model only differs from the ferromagnetic model by a sign, 
but this leads to deep changes. As the transposition operator now carries a negative 
sign in the Hamiltonian, one possibility would be to turn the measure corresponding 



to (3.1 1 1 into a signed measure, with an extra factor (— I) fe where k = k(uj) is 
the number of transpositions. That would mean descending from the heights of 
probability theory to... well, to measure theory. This fate can fortunately be 
avoided thanks to the observations of Aizenman and Nachtergaele pQ. 

Their representation is restricted to bipartite graphs. A graph is bipartite if 
the set of vertices V can be partitioned into two sets Va and Vb such that edges 
only connect the A set to the B set: 

{x, y} £ £ =*> (x, y) e V A x V B or (x, y) € Vb X V A . (3. 15) 

This class contains many relevant cases, such as finite boxes in Z d ; periodic bound- 
ary conditions are allowed provided the side lengths are even. In the following, we 
use the notation 

r isx J 1 if a; e V A , 

(-1 = \ , _ „ (316) 

1—1 it x € V B - 

Instead of the transposition operator, we consider the projection operator 



onto the singlet state described in Lemma [2. 1| Its action on the basis is 

P$\ a ,a} = 0, P<$\a,-a) = - 
for all a = ± | . Further, it follows from Lemma 



P$\a,a} = 0, pW\a-a) = \\a-a)-\\-a,a), (3.17) 



2.1 



that 



Q . Q — - TH r 1 — P( ) 

°i °V — 4 J -' J -\x,y\ r xv 



{x,y}~^xy (3- 18 ) 



Recall that C{oj) is the set of loops of u>. Let j x denote the loop that contains (x, 0). 
We do not need notation for the loops that do not intersect the t = plane. Also, 
it is not the lengths of the loops which are important but their winding number 

w(i). 
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Theorem 3.4 (Aizcnman-Nachtergaele's representation of the antiferromagnet). 
Assume that A is a bipartite graph. The partition function, the average magnetiza- 
tion and the two-point correlation function have the following expressions. 

ZT ti {2P,h)= e"il f l fdp e ,f,{w) J] (2cosh(/?^( 7 ))), 
J 7 e£(o)) 

TrS^e- 2 ^" = e~#l^l J Aps^u) tanh(^( 7x )) 

x \{ (2cosh(/3^( 7 ))), 
7 e£(w) 

TrSfsf e" 2 ^' =!(-in-l)^e-^l J dpeA") 

x TT (2cosh(^/ l w( 7 ))) x j 1 */7x=7y, 
7G£ | w) [tanh(/3/i«;( 7a: ))tanh(^/iu;( 7j/ )) ifi x +i v . 

When ft, = 0, we get the simpler factor 2' £ ^^. 



n 



\_ _j_ _j_ i i i j_ 

2 2 2 2 2 ~2 2 

Figure 3. Each loop is characterized by a given spin, but the 
values alternate according to whether the site belongs to the A or 
B sublatticc. 



Proof. As before, we expand the partition function using Corollary |3.2| and 
we get 

Zl nti (2/3, h) = e~^ £ I Tr e ^M A +T, e 2P ^ 



e 2 



f dpe !fj (u) Y,(s {V) \e 2tlhMA 2^ 0) ■ ■ ■ ^ e : 

** f\}\ 



2(/3-t h )hM A | s ( y )) 7 



where (ei,ti), . . . , (e/.,i/.) are the times and the edges of lu. Notice that 

e tM A | a cv)j = e t< s < v )|AfAk (v) ) | S (V)^ 

In Dirac's notation, the resolution of the identity is 

id V = E i* (v) ><* (v) i. 



(3.19) 



(3.20) 



(3.21) 
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We insert it on the right of each operator J'] ' 1 and we obtain 

Zr ti (2p,h) = e-f l £ l [dp e ,p(<») E e 2tl ''<.x 0;) IWA|.^> <a (V)| 2 pW| s (v) ) 

.(V) „(V) 

xe 2(t 2 -t 1 )/ l (4 V) I^Al4 V> > (4 V) |2FW|4 V) ) . . . (s^ V) |2P e (0) |s^ V) ) e 2(/9-t*)/»<4 V5 |M A |4 V) ) _ 

(3.22) 

Let us now observe that this long expression can be conveniently expressed in the 
language of loops. We can interpret oj and s£ , . . . , si as a spin configuration s 



in V x [0,/3] P er- It is constant in time except possibly at (ej,£j). By (3.171, the 
product 

<4 V) i4 v) > - - - <4 V) |2^i fe °> i4 v) ) 

differs from iff the value of (— l) x s(x,t) is constant on each loop (see illustration 
in Figure [3]). In this case, its value is ±1, as each bridge contributes +1 if the spins 
are constant, and —1 if they flip. Let us check that, in fact, it is always +1. If the 
bridge separates two loops with spins a and b, the factor is 

(-l) a ~ b = C ma . (3.23) 

Looking at the loop 7 with spin a, there is a factor e 17ra for each jump A— >B (of 
the form 1—1 ) and a factor e _17ro for each jump B— >A (of the form lj). Since there 
is an identical number of both types of jumps, these factors precisely cancel. 
The product 

e 2 tl h( s ( v) |M A | s < v) ) e 2(t 3 -ti)ft(4 V) |Af A |4 V) ) _ e 2(/3-t fc )^<4 V) I^Al4 V) > 

also factorizes according to loops. The contribution of a loop 7 with spin a is 
e 2/ l L A (7)a-2/ l L B (7)Q ; where L A ,L B are the vertical lengths of 7 on the A and B 
sublattices. We have 

La (7) _ £b(t) = (3.24) 
The contribution is therefore e 2 ^ hw h) a . Summing over a = ±|, we get the hyper- 
bolic cosine of the expression for the partition function of Theorem |3.4| 



(3) 

The expression that involves S x is similar; the only difference is that the 
loop that contains (x, 0) contributes (— l) x smh(/3hw(-f)) instead of 2cosh(/3hw('y)), 

(3) (3) 

hence the hyperbolic tangent. Finally, the expression that involves Sx Sy is 
similar but we need to treat separately the cases where (x, 0) and (y, 0) belong or 
do not belong to the same loop. □ 

4. Thermodynamic limit and phase transitions 

Phase transitions are cooperative phenomena where a small change of the exter- 
nal parameters results in drastic alterations in the properties of the system. There 
was some confusion in the early days of statistical mechanics as to whether the for- 
malism contained the possibility of describing phase transitions, as all finite volume 
quantities are smooth. It was eventually realized that the proper formalism involves 
a thermodynamic limit where the system size tends to infinity, in such a way that 
the local behavior remains largely unaffected. The proofs of the existence of ther- 
modynamic limits were fundamental contributions to the mathematical theory of 
phase transitions, and they were pioneered by Fisher and Ruelle in the 1960's; see 
for more references. 
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We show that the free energy converges in the t herm odynamic limit along 
a sequence of boxes in Z d of increasing size (Section 4.1). We discuss various 



characterizations of ferromagnetic phase transitions in Section |4.2[ and magnetic 



long-range order in Section 4.3 In Section 4.4 we consider the relations between 



the magnetization in the quantum models and the lengths of the cycles and loops. 

4.1. Thermodynamic limit. Despite our professed intention to treat arbi- 
trary graphs, we now restrict ourselves to a very specific case, namely that of a 
sequence of cubes in Z d whose side lengths tends to infinity. Since F A (f3, h) scales 
like the volume of the system, we define the mean free energy f A to be 

f A (0,h) = ^-F A (P,h). (4.1) 

We consider the sequence of graphs A„ = (V„, £„) where V n = {1, . . . , n} d and £ n 
is the set of nearest-neighbors, i.e., {x,y} £ £ n iff ||x — y|| = 1. 

Theorem 4.1 (Thermodynamic limit of the free energy). The sequence of functions 
(/a„ (P, h)) n >i converges pointwise to a function f(f3, h), uniformly on compact sets. 



Figure 4. The large box of size n is decomposed in k boxes of 
size to; there are no more than drn d ~ 1 remaining sites in the darker 
area. 



Proof. We consider the ferromagnetic model, but the modifications for the 
antiferromagnetic model are straightforward. We use a subadditive argument. No- 
tice that the inequality Tr e A+B > Tr e A holds for all self-adjoint operators A, B 
with B > 0. (This follows e.g. from the minimax principle, or from Klein's in- 
equality.) We rewrite the Hamiltonian so as to have only positive definite terms. 
Namely, let 

h x ,y — S x ■ S y + fid. (4.2) 

Then 

Z A (j3,h)= e-^l £ lTrexp(/3 £ h XlV + f3h £ fi& 3 >) . (4. 3) 

{x,y}££ xEV 

Let rn, n, k, r be integers such that n — km + r and < r < to. The box V„ is 
the disjoint union of k d boxes of size to, and of some remaining sites (fewer than 
drn d_1 ); see Figure El for an illustration. We get an inequality for the partition 
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function in A n by dismissing all h XtV where {x, y} are not inside a single box of size 
to. The boxes V m become independent, and 

k d 

ZA n W,h) > e-i^-l [Tr K( v m) exp(/3 £ h XtV + f3h £ S x 3 ^] 

{x,y}e£ m i6V m (4.4) 

= [Z Am (P 7 h)fe-^ £ ^ e kd ^ £ ^. 

We have neglected the contribution of eP hs * ' for x outside the boxes V m , which is 
possible because their traces are greater than 1. It is not hard to check that 

\En\ < k d \£ m \ + k d dm d - x + d 2 rn d - 1 . (4.5) 

We then obtain a subbaditive relation for the free energy, up to error terms that 
will soon disappear: 

, m ^ (km) d r ,„ 3k d dm d - 1 3d 2 r 

/A„ (P, h) < K -J~h m (P, h) + ^ + — . (4.6) 

Then, since - > 1 as 77 — > 00, 

lim sup / A (fj, h) < f A (fi, h) + ^~. (4.7) 

Taking the liminf over m in the right side, we see that it is larger or equal to the 
lim sup, and so the limit necessarily exists. 



Uniform convergence on compact intervals follows from Lemma 2.4 (which im- 
plies that (/a„) is equicontinuous) and the Arzela-Ascoli theorem (see e.g. Theorem 
4.4 in Folland" [21]). □ 

Corollary 4.2 (Thermodynamic limit with periodic boundary conditions). Let 

(AP° r ) be the sequence of cubes in 7L d of size 77 with periodic boundary conditions 
and nearest-neighbor edges. Then (fA^"(l3,h)) n >i converges pointwise to the same 
function f((3,h) as in TheoremUTn uniformly on compact sets. 



This follows from |/ap™(/3, h) — /a„ (/3, h)\ < f^, which is not too hard to prove, 
and Theorem HHJ 



4.2. Ferromagnetic phase transition. In statistical physics, an order pa- 
rameter is a quantity which allows one to identify a phase, typically because it 
vanishes in all phases except one. The relevant order parameter here is the mag- 
netization, which is expected to be zero at high temperatures and positive at low 
temperatures. There are actually three natural definitions for the magnetization; 
we show below that the first two are equivalent, and that the last one is smaller 
than the first two. 

• Thermodynamic magnetization. This is equal to (the negative of) the 
right-derivative of /(/3, h) with respect to h. We are looking for a jump 
in the derivative, which is referred to as a first-order phase transition. 

* , m ,. mh)-mo) 

™th(P) = - km . (4.8) 

h-yO+ a 

(The limit exists because / is concave.) 
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• Residual magnetization. Imagine placing the ferromagnet in an exter- 
nal magnetic field, so that it becomes magnetized. Now gradually turn 
off the external field. Does the system still display global magnetization? 
Mathematically, the relevant order parameter is 



m r 



s(0) 



lim 



lim inf 



f 



>A n ,/3,h- 



(4.9) 



(We see below that the lim inf can be replaced by the lim sup without 
affecting m* es . The limit over h exists because (M A „ 
a concave function, and it is therefore monotone.) 
Spontaneous magnetization at h = 0. Since (Ma) 
spin flip symmetry), we rather consider 



is the derivative of 



(because of the 



1 



TO s P (/3) = lim inf —t(|M a J)a„,/3,o- 

1 n— too 71 

Here, |M A | denotes the absolute value of the matrix M\. 



(4.10) 



A handier quantity, however, is the expectation of M\, which can be expressed in 
terms of the two-point correlation function, see below. It is equivalent to m* p in 
the sense that both are zero or both differ from zero: 



Lemma 4.3. 



\ |V| /A,P,Q - \V |V| J /A, ,3,0 

Proof. For th e firs t inequality, use |Ma| 



- 2\ |V| 



A,j8,0' 



| Ma | Id and then the Cauchy- 
Schwarz inequality (|2.24|. For the second inequality, observe that |Ma| < ||V|Id 

Ma |, and use the fact that the Gibbs state is a positive 

□ 



implies that M\ 
linear functional. 



< 



W\ 



f 




Figure 5. Qualitative graphs of the free energy /(/3, h) as a func- 
tion of h, for large (top) and /? small (bottom). 



Proposition 4.4. The three order parameters above are related as follows: 

Proof of m t * h = m* es . We prove that whenever /„ is a sequence of differen- 
tiable concave functions that converge pointwise to the (necessarily concave) func- 
tion /, then 

D+/(0) = lim lim sup f L(h) — lim Kminf fL(h). (4.11) 
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Up to the signs, the left side is equal to ml h and the right side to m* es and we 



obtain the identity in Proposition 4.4 The proof of (4.11 1 follows from the general 
properties 



limsup^inf ay] < inf (limsupaijj, 

i J J i 

liminf (supay) > sup(liminf a^), 

% 3 j 1 



(4.12) 



and from the following expressions for left- and right-derivatives of concave func- 
tions: 

f(h + s)-f(h) 



D- m = M f W-« h -°\ 

s>0 s 



D + f(h) = sup 

s>0 



(4.13) 



(4.14) 



With these observations, the proof is straightforward. For h > 0, 

D + f(0) > D-f(h) = inf limsup /kW - ^ - > limsup /^(/i) 

> liminf fjh) > sup liminf / " (fe + ' s) zluM = D+f(h). 

n— yoo s>0 n ~*°° S 

Since right-derivatives of concave functions are right-continuous, the last term con- 
verges to D+f(0) as ft -» 0+. This proves Eq. p^LTi □ 



PROOF of m* ea > \fn* p . Let h > 0, and let {<Pj} be an orthonormal set of 
eigenvectors of 1?a„,o an d M\ n with eigenvalues ej and rrij, respectively. Because 
of the spin flip symmetry, we have 



A„/A„,/3,/i 



rrin e 



-0ej ( pPhrrij 



> 



(4.15) 



After division by n d , we only need to consider those j with rnj ~ n d , in which case 
e -2/3hmj ^ o. We can therefore replace the parenthesis by 1 in the limit n — > oo. 
On the other hand, consider the function G n (h) = j logTr e -P H ^.o+Ph\M An \ . One 



can check that it is convex in h, see (2.22), so G' n {h) > G' n (0). Its derivative can 
be expanded as above, so that 

J2 \m j \e-P e ^ + f >h \ m ^ 



G'Jh) 



e -i ie 3+f ih \ m j\ 



(4.16) 



This is equal to twice the second line of (4.15) (without the parenthesis). Then 



m* Ies {f}) > 5 lim liminf \c' n {h) > § liminf -W n (0) = §m s * p (/3). 



(4.17) 



□ 



4.3. Antiferromagnetic phase transition. While ferromagnets favor align- 
ment of the spins, antiferromagnets favor staggered phases, where spins are aligned 
on one sublattice and aligned in the opposite direction on the other sub-lattice. The 
external magnetic field does not play much of a role. One could mirror the ferro- 
magnetic situation by introducing a non-physical staggered magnetic field of the 
kind h^2 xeV (— l) x si 3 \ which would lead to counterparts of the order parameters 
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m* h and m* es . We content ourselves with turning off the external magnetic field, 
i.e. setting h = 0, and with looking at magnetic long-range order. For x, y G V, we 
introduce the correlation function 

VA,p(x,y) = (-in-l) y (S^S^) A , Pt0 . (4.18) 

One question is whether 

W 2 (f3) = liminf — l — ^ <r\j(x,y) (4.19) 
™^°° |l/n| x, 9 ev„ 

differs from 0. A related question is whether the correlation function does not decay 
to as the distance between x and y tends to infinity. One says that the system 
exhibits long-range order if this happens. 

In Z d and for /3 large enough, it is well-known that there is no long-range order 
and that the correlation function decays exponentially in \\x — y\\. Long-range order 
is expected in dimension d > 3 but not in d = 1,2. This is discussed in more detail 
in Section 

4.4. Phase transitions in cycle and loop models. In this section, we 
clarify the relations between the order parameters of the quantum systems and the 
nature of cycles and loops. This yields probabilistic interpretations for the quantum 
results. We introduce two quantities, which apply simultaneously to cycles and 
loops. Recall that -f x denotes either the cycle that contains (x, 0) in the cycle 
model, or the loop that contains (x, 0) in the loop model. We write P and E for 
pcycics anc j gioop w j ien equations hold in both cases. 

• The fraction of vertices in infinite cycles/loops: 

Vao(P,h)= lim liminf -^tEa„ ,p,h {#{x € V„ : L{^ x ) > K}). (4.20) 

• The fraction of vertices in macroscopic cycles/loops: 

Wcro(/3, h) = lim liminf ~EA n ,B,h(#{z € V„ : L(j x ) > en d }). (4.21) 

It is clear that 77oo(/3,/i) > r) macTO (fl,h). These two quantities relate to magnetiza- 
tion and long-range order as follows. The first two statements deal with cycles and 
the third statement deals with loops. 

Proposition 4.5. 

(a) m r * es (2/?)>i lim r^fah). 

h— >0+ 

(b) m* sp (2(3) > ^ WcroCM) > 0. 

(c) a(2/3) > <f=> ((3,0) >0. 

Proof. Let 

m(2/3, h) = liminf (S^ ')A n apM- (4.22) 



We use tanhx > tanhi^ • 1 x >k, which holds for any K, and Theorem |3.3| so as to 

get 

m(2/3, h) > i t&nh(hK) liminf P^ c ^ ?i (L(7 ) > K). (4.23) 

Taking K — > 00, we get m(2/3, h) > |??oo(/3j h). We now take h —> 0+ to obtain (a). 
For (b), we observe that, since the vertices of A„ are exchangeable, 
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It follows from Lemma \4. 31 that 

m: p (2/?) > <=► liminf E^ (^) > 0. ( 4 , 5 ) 
On the other hand, we have 

?7macro(/3,0) = lim lim inf P^ l f ( L 3 > £ Y ( 4 - 26 ) 

The result is then clear. 

The claim (c) is identical to (b), with loops instead of cycles. □ 



It should be possible to extend Proposition 4.5 (a) so that m* es (/3) > 
Voo(P>0) > 0. This suggests that m* h and m* es are related to the existence of 
infinite cycles, while m* p is related to the occurrence of macroscopic cycles. The 
question is then whether there exists a phase in which a positive fraction of vertices 
belongs to mesoscopic cycles or loops. Such a phase could have something to do with 
the Berezinskh-Kosterlitz-Thouless transition [7J 1361 . which has been rigorously 
established in the classical XY model |25j . It is not expected in the Heisenberg 



model, though. The Mermin- Wagner theorem (Section 5.1| rules out any kind of 
infinite cycles or loops in one and two dimensions. 

5. Rigorous results for the quantum models 

Quantum lattice systems have seen a considerable amount of study in the past 
decades, and the effort is not abating. Physicists are interested in properties of the 
ground state (i.e., the eigenvector of the Hamiltonian with lowest eigenvalue), in 
dynamical behavior, and in the existence and nature of phase transitions. Out of 
many results, we only discuss two in this section, which have been chosen because 
of their direct relevance to the understanding of the cycle and loop models: the 
Mermin- Wagner theorem concerning the absence of spontaneous magnetization in 
one and two dimensions, and the theorem of Dyson, Lieb, and Simon concerning 
the existence of long-range order in the antiferromagnetic model. 

5.1. Mermin- Wagner theorem. This fundamental result of condensed mat- 
ter physics states that a continuous symmetry cannot be broken in one and two 
dimensions [38 . In particular, there is no spontaneous magnetization or long-range 
order in Heisenberg models. 

Theorem 5.1. Let (AP er )„>i be the sequence of cubic boxes in 1 d with periodic 
boundary conditions. For d = 1 or 2, and for any (3 £ [0, oo), 

m rc S W) = 0- 



By Proposition 4.4 all three ferromagnetic order parameters are zero, and there 
are no infinite cycles by Proposition |4.5| in the cycle model that corresponds to the 
Heisenberg ferromagnet. The theorem can also be stated for the staggered magnetic 
field discussed in Section [4~3] One could establish antiferromagnetic counterparts 



to Lemma 4.3 and Proposition 4.4 and therefore prove that ?7oo(/3) is also zero in 
the loop model that corresponds to the Heisenberg antiferromagnet. 

An open question is whether the theorem can be extended to more general 
measures of the form 

■& lC{u)l dp s ,p(u) and ^ c ^dp e ,p(u) 
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(up to normalization), for values of $ other than d = 2. The case 3' £ ^' can actually 
be viewed as the representation of a model with Hamiltonian — y }e£^ x ' 
(see pQ) and the Mermin- Wagner theorem certainly holds in that case. 

The theorem may not apply when $ is too large, and the system is in a phase 
with many loops, similar to the one studied in (12] . 

We present the standard proof [44j that is based on Bogolubov's inequality. 



Proposition 5.2 (Bogolubov's inequality). Let (3 > and A,B,H be operators on 
a finite- dimensional Hilbert space, with H self-adjoint. Then 

\Tr[A,B]e- 0H \ 2 < ^(3Tr (AA* + A* A) e~ m Tr [[B,H],B*] e~ m . 

Proof. We only sketch the proof; see |44] for more details. Let {ifi} be an 
orthonormal set of eigenvectors of H and {e{\ the corresponding eigenvalues. We 
introduce the following inner product: 



Q~P e 3 — e ~P e i 

(A,C)= (fu-A*V>i)(<Pj,0(pi) 

... e% Cj 



(5.1) 



One can check that 

(A, A) < |/3Tr (AA* + A* A) e~^ H . (6 .2) 
We choose C = [B*,H], and we check that 

Tc[A,B]e- pH = (AC) (5.3) 

and 

Tr[[B,H],B*]e-P H = (C,C). (6.4) 



Inserting (5.3) and (5.4) in the Cauchy-Schwarz inequality of the inner product 
(5.1), and using (5.2), we get Bogolubov's inequality. □ 

Proof of Theorem 15. 11 Let m n (/3,h) = n~ d (M Ari ) Aritj3th . Let 

S^ = ^(S^±iS^). (5.5) 

One easily checks that 

[Si+\S^] = Sj^6 x , v . (5.6) 
It is convenient to label the sites of AP er as follows 

V n = {x G Z d : -§ < Xi < f ,1 = 1, . . . , d}. (5.7) 

£ n is again the set of nearest-neighbors in V ra with periodic boundary conditions. 
For k G v"V n , we introduce 



n d/2 / . ' "' ■ 



x£V n 

where kx denotes the inner product in M. d . Then, using (|5.6 



([S(+)(t),SH(-i)K lW =i e~ ikx e iky ({Si +) >S { y - } })A n ,p, h 

= m n (f3,h) 



n 

x,y£V n (5.9) 
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This will be the left side of Bogolubov's inequality. For the right side, tedious but 
straightforward calculations (expansions, commutation relations) give 

([[SW(k),H An ],S(-\-k)]) An ^ h 
Despite appearances, this expression is real and positive for any fc, as can be seen 



from (5.4 1. We get an upper bound by adding the same quantity, but with — k. 



This yields 



^ J2 (l-cosk(x-y))(S^S^+S^S^) An ^ h + 2hm n (p,h). 



From Lemma |2.1[ wc have 

\(S^S^ + S^S^) An J = \(S X - 4) An , M | < |. ( 5 .ii) 
Let us now introduce the "dispersion relation" of the lattice: 



e(k) = ^^(1 — cos ki). 



(5.12) 



i=l 

Inserting all of this into Bogolubov's inequality, we get 

\2 



(5.13) 



Summing over all k € V n , and using e lk ^ x yS> — 5 x ,y, we have 

k 

Then 



k£ V n 

As n — > oo, we get a Riemann integral, 

, , 2 1 f dk 
m ^ H) (2^F Hk) + 2\ hmn ^h)\ * * 

Since e(k) rs fc 2 around fc = 0, the integral diverges when ft, — > 0, and so m(/3, h) 
must go to 0. □ 

Notice that the integral remains finite for d > 3; the argument only applies to 
d = l,2. 
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5.2. Dyson-Lieb-Simon theorem of existence of long-range order. Fol- 
lowing the proof of Frohlich, Simon and Spencer of a phase transition in the classical 
Heisenberg model |24j , Dyson, Lieb and Simon proved the existence of long-range 
order in several quantum lattice models, including the antiferromagnetic quantum 
Heisenberg model in dimensions d > 5 [17] , Further observations of Neves and 
Perez |40j , and of Kennedy, Lieb and Shastry [32j , imply that long-range order is 
present for all d > These articles use the "reflection positivity" method, which 
was systematized and extended in |22l 123] . We recommend the Prague notes of 
Toth |49j and Biskup |10j for excellent introductions to the topic. See also the 
notes of Nachtergaele |39] . 



Recall the definition of a in Eq. (4.19) 



Theorem 5.3 (Dyson-Lieb-Simon). Let (AP cr ) be the sequence of cubic boxes in 
1 d , d > 3, with even side lengths and periodic boundary conditions. There exists 
/3o < oo such that, for all f3 > fto, the Heisenberg antiferromagnet has long-range 
order, 

*(P) > 0. 

Clearly, this theorem has remarkable consequences for the loop model with 
weights 2l £ ( w 'L Indeed, there are macroscopic loops, r] macro (/3, 0) > 0, provided 
that (3 is large enough. 

Despite many efforts and false hopes, there is no corresponding result for the 
Heisenberg ferromagnet, and hence for the cycle model. 



The proof of Theorem 5.3 for d > 5 can be found in |17j (see also |22L 149] 
for useful clarifications). In the remainder of this section we explain how to use 
the observations of |40j and |32j in order to extend the result to dimensions d = 3 
and d = 4. As these articles deal with ground state properties rather than positive 
temperatures, some modifications are needed. We warn the readers that this part 
of the notes is not really self-contained. 



Recall the definitions of the operators S^''(k) in Eq. (5.8 1. We need the 
Duhamel two-point function, which is reminiscent of the Duhamel formula of Propo- 
sition 13.11 

(S^(k),S (j) (-k)) A0o = ry h n > / e- s0HK <° S U) (k) e -(i-»)/»*A.o S^(-k)ds. 

^a(p, 0) J 

(5.17) 

Recall also the definition of e(k) in ( |5.12[ ), and let n = (n, . . . , 7r) € R d . We have 



S(k — 7T ) = + COS fcj). 



(5.18) 



i=l 



Let e n (/3) denote the negative of the mean energy per site, i.e., 

en{fi) = -(— ^\ . (5.19) 

One can show that e„(/3) is nonnegative, increasing with respect to j3, and that it 
converges pointwise as n — > oo. 

The main result of reflection positivity is the following "Gaussian domination" . 

Proposition 5.4. If k £ ^fV„ and k ^ ir, we have 



^We are indebted to the anonymous referee for pointing this out and for clarifying this to us. 
The following explanation is essentially taken from the referee's report. 
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(a) (S^(k),S^(-k)) An , p , < _ 



(k-n)J 2j3£(k-w) 

Sketch proof. The claim (a) can be found in [IT], Theorem 6.1. The claim 
(b) follows from Eqs (3), (5), and (6) of [40 , and from the relation 

3 

J2([S ij) (k),[H A , o ,SW(-k)]]) At0o = js(k)e n (P). (5 .20) 

This is Eq. (55) in [T7]. □ 
Next, let 

^) = i E (-in-lYiS^S^A^o. (5.2i) 

Then tr(/3) = liminf„ a n (f3), and the goal is to show that it differs from zero. For 
k = 7?, we have 

(S (3) (t?)S (3) (-7?))a,0,O = n d a n (P). (5.22) 

Kennedy, Lieb and Shastry [32 a have proposed the following sum rule, which im- 
proves on the original one used in [241 117] : 

~ E (^ (3) (fc)^ (3) (-fc))A,/3,0COsfc l = (4 3) 5(f) A: , 3i0 , (5.23) 

where denotes the neighbor of the origin in the ith direction. Because of the 
symmetries of V n (translations and lattice rotations), we have 



/q(3)q(3)\ e »(/3) 

Wo b ei MAO = ^7-- 



3d 

The sum rule can be rewritten as 

e n 03) _ 



, )(/ + i E (^ (3) (fc)^ (3) (-fc))A^,o(-E cosfc > 



(5.24) 



(5.25) 



d 

cos k, 



(5.26) 



By Proposition 5.4 (b), we have 

ejj?) <7f , m , e n (P) 1/2 1 N - / g(fc) y/y v - 

3d S (6d)V2 d „d 2^ U(fc-7f)J V 2^ 

+ E £ ( k _ jf) (-E cosfc *) + - 

fee^fv„ »=i 
As n — )• oo, with e(/3) = lim„ e„(/3), we get 

e(/?) < a{B)+ ?^)!!o, JL /A., I n 1 ^ ( ~Y COS ki) dfc, (5.27) 



3d - (6d)!/2rf ^ > 2df3 (27r) d J^,^ e(* - tt) 
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where 

1(d) = ,„ , ■ / [—71 ) (— > COSfci) dfc. (5.28) 

v ; (27r)««y [ _ ir)T ]Ae(fc-7?)V V ^ V + 

The last integral in ( 5.27[ ) is finite when c? > 3, and this term may be made arbi- 
trarily small by choosing j3 large enough. It follows that a sufficient condition for 
<j(/3) > for large enough (3, is that 

ef/3) 1 / 2 1 

^^ d - > wm I{d) - 

The integral 1(d) can be calculated numerically: 1(3) = 1.04968... and 1(4) = 
1.01754... It is then enough to show that lhm3_ i . 00 e(/3) > 0.5509... in d = 3 and 
lim^oo e(f3) > 0.3883... in d — 4. The following lemma allows us to conclude that 
long-range order indeed takes place in d = 3 and d = 4. 

Lemma 5.5. 

lim e(B) > -. 
,3^00 ~~ 4 

Proof. The Gibbs variational principle states that 

Fa(P, h) < Tr pH A ,h - ^Sa(p) (s.so) 

for any operator p in %( v ) such that p > and Tr p = 1. Here, 5a is the Boltzmann 
entropy 

Sa(p) = -Trplogp. (5.31) 
See e.g. Proposition IV. 2. 5 in |46j (the setting in |46j involves a normalized trace, 
hence there are a few discrepancies between our formula? and those in the book). 
It is known that the Gibbs state p = Z\(/3, h)~ x e~^ HA - h saturates the inequality, 
and that the entropy satisfies the bounds 

0<S A (p)<|V|l O g2. (5.32) 

It follows that 

e(^)>-/(A0)-^. (5.33) 

In order to get a bound for the free energy, we use (5.30) with the Neel state V^Neci 
as a trial state, 

*Ncel- ® |(-1)*5>. (5-34) 
iGA„ 

With p the projector onto ^Noob we have S\ n (p) = 0, and 

^Aj/3,0) < (*N6cl,ffA„,0l , Neol) 



(5.35) 

2 ■ I !" '■' ' '!' ' 2 ' 2l ' 



dn ( „ , 9 1 S x ■ $, ' 



The last inner product is in T-L X ®T-L y . Using (3.8 1, we find that it is equal to — 4. □ 



These results do not apply to dimension d = 2 because the last integral in 



(5.27) is divergent. We already know that the magnetization is zero for all finite 
values of f3 by the Mermin- Wagner theorem. An important question, which remains 
open to this day, is whether long-range order occurs in the ground state of the two- 
dimensional antifcrromagnct. The last integral in (5.27) disappears if the limit 
/3 — > 00 is taken before the infinite volume limit, and the question is whether (5.29) 
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is true. Since 1(2) = 1.29361... one needs linig^oo e(/3) > 1.255.... But the limit is 
expected to be around 0.67 [32 and so the method does not apply. 

In contrast to the antiferromagnet, the ground state of the ferromagnet is triv- 
ial with full magnetization. If /3 is taken to infinity in the cycle model for a fixed 
graph, the spatial structure is lost and the resulting random permutation has Ewens 
distribution (that is, it is weighted by 2' c l). Almost all vertices belong to macro- 
scopic cycles and the cycle lengths are distributed according to the Poisson-Dirichlct 
distribution PD2. 



6. Rigorous results for cycle and loop models 



The cycle and loop representations in Theorems |3.3| and |3.4| are interesting in 
their own right and can be studied using purely probabilistic techniques. Without 
the physical motivation, the external magnetic field is less relevant and more of 
an annoyance. We prefer to switch it off. The models in this simpler situation 
are defined below, with the small generalization that the geometric weight on the 
number of cycles or loops is arbitrary. This is analogous to how, for example, 
one obtains the random cluster or Fortuin-Kasteleyn representation from the Ising 
model. 

6.1. Cycle and loop models. As usual we suppose that A = (V, £) is a 
finite undirected graph. Recall that the Poisson edge measure p£ p is obtained by 
attaching independent Poisson point processes on [O,0\ to each edge of £. 

For each realization u> of the Poisson edge process, we define cycles C{uS) and 



loops C(uj) as in J 3.1 The random cycle and loop models are obtained via a change 
of measure in which the number of cycles or loops receives a geometric weight 1? > 0. 
That is, the probability measures of interest are 

where Z'^(J$) are the appropriate normalizations. As remarked above, •& = 2 is the 
physically relevant choice in both these measures. 

The main question deals with the possible occurrence of cycles or loops of 
diverging lengths. Recall the definitions of the fraction of vertices in infinite cy- 
cles/loops, 7700 (/?), and the fraction of vertices in macroscopic cycles/loops, 



?7macro(/3), which were defined in Section 4.4 (We drop the dependence in h, since 
h = here.) In the case where the graph is a cubic box in 7L d with periodic bound- 
ary conditions, and 1? = 2, the Mermin- Wagner theorem rules out infinite cycles in 
one and two dimensions, and the theorem of Dyson-Lieb-Simon shows that macro- 
scopic loops are present in d > 3, provided that the parameter (3 is sufficiently 
large. 

It is intuitively clear that there cannot be infinite cycles or loops when /3 is 
small. In Section |6.2| we prove this is indeed the case and give an explicit lower 
bound on the critical value of /3. 

The model for <& = 1 is known as random stirring or the interchange process. 
The question of the existence of infinite cycles in this setting has been considered 
by several authors. Angel considered the model on regular trees, and proved the 
existence of infinite cycles (for j3 lying in an appropriate interval) when the degree of 
the tree is larger than 5 [4] . Schramm considered the model on the complete graph 
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and obtained a fairly precise description of the asymptotic cycle length distribution 



We review this important result in Section 6.3 Recently, Alon and Kozma 
found a surprising formula for the probability that the permutation is cyclic, using 
representation theory [3]. 

6.2. No infinite cycles at high temperatures. We consider general graphs 
A = (V, £). We let k denote the maximal degree of the graph, i.e., k = sup^gy \{y : 
{x, y} € £}\. Recall that L(p/ X ) denotes the length of the cycle or loop that contains 
x x {0}. Let a be the small parameter 



i?- x (l-e-^) if i9 < 1, 
l-e-^ if#>l. 



(6.2) 



in the case of cycles and 



(6.3) 



_\$- x {l- b-P) if < 1, 
a ~\e-^(e^ -1) if i? > 1. 

in the case of loops. 

Theorem 6.1. For either the cycle or the loop model, i.e., for either measure in 



(6.1), we have 

pa !/3 (£(7x) > i3k) < (o(k - ^rva - $)- K+1 ] k . 

for every x 6 V. 

Of course, the theorem is useful only if the right-hand side is less than 1, in 
which case large cycles have exponentially small probability. This result is pretty 
reasonable on the square lattice with i9 < 1. When d > 1, configurations with many 
cycles are favored, and the domain should allow for larger j3. Our condition does 
not show it. The case t3 3> 1 is close to the situation treated in |12j with phases of 
closely packed loops. In the case of the complete graph on N vertices and i? = 1, the 
maximal degree is n = N — 1 and the optimal condition is f3 < 1/N (Erdos-Renyi, 
|18j ). Using an < f3N and (1 — < e,we see that our condition is off by a 

factor of e . 

As a consequence of the theorem, we have ?7oo(/?) = for small enough j3. This 
implies that Tn*(j3) = W(j3) = in the corresponding Heisenberg ferromagnet and 
antiferromagnet. One could extend the claim so that m* b ({3) = as well. 

Proof. Given u, let G(oj) = (V,E) denote the subgraph of A with edges 

E = {a: (ei,U) e uj}, (6.4) 

and V — Uiei the set of vertices that belong to at least one edge. G(w) can be 
viewed as the percolation graph of uj, where an edge e is open if at least one bridge 
of the form (e,t) occurs in w. Then we denote C x (u>) — (V X ,E X ) the connected 
component of G(uj) that contains x. It is clear that L(-f x ) < f3\V x \ for both cycles 
and loops. Then, using Markov's inequality, 

PaA l (1x) > Pk) < Pa,/j(|^x| > k) < a- k E A ^{a^), (e.s) 

for any a > 1. 

We consider first the case of cycles. Given a subgraph G' = (V , E') of A, let 
4>{G') = 0-W'\ [ 1 [G(W ) =G ,] fiVWdp&Au)- < 6 - 6 ) 
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By partitioning according to the connected components of G(u>), then using 
the fact that p£^ is a product measure over edges and that cycles are contained 
entirely within connected components, we have 

^cycles, |V'|\ ST^Afr»\ W'\ ^G'nC;=0 ^(^ ) 



Eg- 4>(G") 



The first sum is over connected subgraphs C' x = (V x , E' x ) of A that contain x. 
The second sum is over subgraphs G' = (V',E') that are compatible with C' x , in 
the sense that V n V' x = and V' U C' x = V. The sum in the denominator is over 
all subgraphs G" = (V" , E") with V" = V. 

Notice that for any C' x , the corresponding compatible graph G' = (V , E') can 
be enlarged to G" = (V,E') by adding the vertices from V' x . The new vertices from 
V x are all disconnected in G" . Thus, if G(ui) = G", each vertex in V x necessarily 
forms a single cycle of length 1. It follows that 4>(G") = 4>(G'). Furthermore, 



different G" give rise to different G" . So, the ratio in (6.7) is less than 1. 
Now we claim that 

0(G') < 0} E '\ (6.8) 

for any connected G' . First consider ■& < 1. Since G' is connected we have \E'\ > 

\V'\ - 1. So, tf-m+IGMI < ^-|V|+i < $-\E'\ for any w _ when d > 1; use 
|C(w)| < \V'\ to see #-\v'\+\c(u)\ < L 

On the other hand, G(w) = G' holds if and only if the Poisson process for each 
edge of G' contains at least one point. So, 

l[G(w)=G'] dpE',p( u ) = C 1 ~ 6-/3 )' £ < 6 - 9 ' 



and (6.8) follows in the case of cycles. 

The s ame bound also holds for the loop model when $l c (")l is replaced by 



m (6.6). For ■& < 1 the argument is the same as before. For i? > 1, we use 
the inequality |£(w)| < \V'\ + \ui\ that holds for any u>, where \ui\ is the number of 
bridges in w. This follows from the fact that each bridge in cj cither splits a loop 
into two or merges two loops (see Lemma 8.1 ), and that |£(w)| = \ V'\ when u = 0. 
Hence, 

0(G') < J *M t { G { „)=G>]&PE>Au) = (e"' 3 E '. ^ 



which gives the bound (6.8) for loops 



Combining (16. 71 and (16.81) shows that for either loops or cycles, 



ur ( WA\ ~ST^xrr<i\ W'\ ScnCi=0 ) iyi 



(6.11) 



Let <5(G^.) denote the "depth" of the connected graph C' x , i.e., the minimal number 
of edges of E' x that must be crossed in order to reach any point of V x . Let 

B{£) = a\ y '\a\<\. (6.i2) 

C' X ,S{C> X )<Z 

We want an upper bound for B(£) for any £. We show by induction that B(£) < b 
for a number b to be determined shortly. We proceed by induction on £. The case 
£ = is a < b. For I + 1, we write the sum over graphs with depth less than £ + 1, 
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attached at x, as a sum over graphs of depth less than I, attached at neighbors of 
x. Neglecting overlaps gives the following upper bound: 

B(£+l)<a Yl (l + a a'^'a 1 ^ 1 ) 

y.{x,y}eS C' y ,S(C' y )<e (6.13) 

< a(l + ab) K . 

This needs to be less than b; this condition can be written a < b~ 1 ((b/a) 1 l K — 1). 
The optimal choice that maximizes the possible values ofais& = a(l — ^)~ K . A 
sufficient condition is then 

a<^-(l-i) K - 1 (6.i4) 

We have obtained that 

PaAHJx) > I3k) < a- k+1 (l - (6.i5) 

and this holds for all 1 < a < — (1 — Choosing the maximal value for a, 

we get the bound of the theorem. □ 

6.3. Rigorous results for the complete graph. Suppose Ti, T 2 , T 3 , . . . are 

independent random transpositions of pairs of elements of {1, 2, . . . , n} and tt^ = 
T\ o T2 o . . . o Tfc. Write \(irk) for the vector of cycle lengths in TTk, sorted into 
decreasing order. So, \i(irk) is the size of the i th largest cycle and if there are fewer 
than i cycles in irk, we take Xi(iTk) = 0. 

Note the simple connection between cycles here and the cycles in our model; if 
N is a Poisson random variable with mean /3n(n — 1)/2, independent of the Tj, then 
A(7rjv) has exactly the distribution of the ordered cycle lengths in C under pK n ,p, 
where K n is the complete graph with n vertices. 

Schramm proved that for c > 1/2, an asymptotic fraction rj^ = r] 00 (2c) of 
elements from {1,2, ... ,n} lie in infinite cycles of 7r^ c „j as n — > 00. The (non- 
random) fraction r?oo(2c) turns out to be the asymptotic fraction of vertices lying 
in the giant component of the Erdos-Renyi random graph with edge probability 
c/n. Equivalently, ?7oo(s) is the survival probability for a Galton- Watson branching 
process with Poisson offspring distribution with mean s. Berestycki [6] proved a 
similar result. 

Furthermore, Schramm also showed that the normalised cycle lengths converge 
to the Poisson-Dirichlet(l) distribution. 

Theorem 6.2 (Schramm 45j). Let c > 1/2. The law of \{-Ky cn ^) / {nri oa {2c)) 
converges weakly to PDi as n — ¥ 00. 

7. Uniform split-merge and its invariant measures 

We now take a break from spin systems and consider a random evolution on 
partitions of [0, 1] in which blocks successively split or merge. Stochastic processes 
incorporating the phenomena of coalescence and fragmentation have been much 
studied in the recent probability literature (see, for example, [H [8] or Chapter 
5 of [42], and their bibliographies). The space of partitions of [0,1] provides a 
natural setting for such processes. The particular model we will discuss here has 
the property that the splitting and merging can be seen to balance each other out 
in the long run, so that there exists a stationary (or invariant) distribution. Our 
aim is to summarise what is known about this invariant distribution. Only a basic 
familiarity with probability theory is assumed and we will recall the essentials as 
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we go. This section is self-contained and can be read independently of the first. 
As is the way among probabilists, we assume there is a phantom probability space 
(SI, F, P) that hosts all our random variables. It is summoned only when needed. 

7.1. Introduction. Let Ai denote the space of (decreasing, countable) par- 
titions of [0, 1]. Formally 

Ai := [p e [0, If : Pl > P2 > . . . , = i}. ( 71 > 

i 

where the size of the i th part (or block) of p e Ai is Pi . We define split and merge 
operators Sf, M t j : A x — > Ai, u e (0, 1) as follows: 

• Sfp is the non-increasing sequence obtained by splitting pi into two new 
parts of size upi and (1 — u)pi, and 

• Mijp is the non-increasing sequence obtained by merging pi and pj into a 
part of size + pj . 



2 4 




Figure 6. Illustration for the split-merge process. The partition 
undergoes a merge followed by two splits and another merge. 

The basic uniform split-merge transformation of a partition p is defined as 
follows. First we choose two parts of p at random, with the i th part being chosen 
with probability Pi (this is called size-biased sampling). The two parts, which we 
call pj and pj, are chosen independently and we allow repetitions. If the same part 
is chosen twice, i.e. I — J, sample a uniform random variable U on [0, 1] and split 
pj into two new parts of size Upj and (1 — U)pj (i.e. apply Sf). If different parts 
are chosen, i.e. I =/= J, then merge them by applying Mjj. This transformation 
gives a new (random) element of A^ Conditional on plugging a state p € Ai into 
the transformation, the distribution of the new element of Ai obtained is given by 
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the so-called transition kernel 

K (p,-)~y2Pi Ss% P {-)du + y^piPj5 Mi:i p{-)- (7.2) 

Repeatedly applying the transformation gives a sequence P = {P k )k=v,i,2,... 
of random partitions evolving in discrete time. We assume that the updates at 
each step are independent. So, given P k , the distribution of P k+1 is independent 
of P k ~ 1 , . . . ,P°. In other words, P is a discrete time Markov process on Ai with 
transition kernel K. We call it the basic split-merge chain. 

Several authors have studied the large time behaviour of P, and the related 
issue of invariant probability measures, i.e. [i such that fj,K = /i (if the initial value 
P° is distributed according to fx, then P k also has distribution given by [i at all 
subsequent times k = 1, 2, . . .). 

Recent activity began with Tsilevich [51j . In that paper the author showed that 



(7.3) 



the Poisson-Dirichlet^) distribution (defined in f 7.2 below and henceforth denoted 
PDg) with parameter 9 = 1 is invariant. The paper contains the conjecture (of 
Vershik) that PDi is the only invariant measure. 

Uniqueness within a certain class of analytic measures was established by 
Mayer- Wolf, Zerner and Zeitouni in |37j . In fact they extended the basic split- 
merge transform described above to allow proposed splits and merges to be re- 
jected with a certain probability. In particular, splits and merges are proposed as 
above but only accepted with probability f3 s £ (0, 1] and (3 m £ (0, 1] respectively, 
independently at different times. The corresponding kernel is 

Kp e ,/3 m (p, ') ==& YV 2 / S S u p (-)du + m y2piPjd MiiP (-) 

< Jo m 

} ~ AX!?" ~ Pm^PiPjjSpi-). 

i i^j 

We call this (j3 s , /3 m ) split-merge (the basic chain, of course, corresponds to (3 S = 
f3 m = 1). The Poisson-Dirichlet distribution is still invariant, but the parameter 
is now 9 = /3 s /j3 m (note that, in fact, any invariant distribution for the chain can 
depend on j3 s and f3 m only through 9 since multiplying both acceptance probabilities 
by the same positive constant only affects the speed of the chain). 

Tsilevich |50j provided another insight into the large time behaviour of the the 
basic split-merge process (f3 s — /3 m = 1). The main theorem is that if P° = 
(1,0,0,...) <E Ai, then the law of P, sampled at a random Binomial(n, 1/2)- 
distributed time, converges to Poisson-Dirichlet (1) as n — > oo. 

Pitman [41] studied a related split-merge transformation, and by developing 
results of Gnedin and Kerov, reproved Poisson-Dirichlet invariance and refined the 
uniqueness result of |37; . In particular, the Poisson-Dirichlet distribution is the only 
invariant measure under which Pitman's split-merge transformation composed with 
'size-biased permutation' is invariant. 

Uniqueness for the basic chain's invariant measure was finally established by 
Diaconis, Mayer- Wolf, Zerner and Zeitouni in [16] . They coupled the split-merge 
process to a discrete analogue on integer partitions of {1, 2, . . . , n} and then used 
representation theory to show the discrete chain is close to equilibrium before de- 
coupling occurs. 



HEISENBERG MODELS AND THEIR PROBABILISTIC REPRESENTATIONS 209 



Schramm [45] used a different coupling to give another uniqueness proof for 
the basic chain. His arguments readily extend to allow f3 s /f3 m e (0, 1]. In summary, 

Theorem 7.1. 

(a) Poisson-Dirichlet(f3 s / /? m J is invariant for the uniform split-merge chain 

mth/3 s ,p m e (0,1]. 

(b) If [3 S / (3 nl < 1, it is the unique invariant measure. 



We give a short proof of part (a) in Section 7.3 below 



7.2. The Poisson-Dirichlet distribution. Write A^i(Ai) for the set of 

probability measures on Ai. The Poisson-Dirichlet distribution PDg 6 A^i(Ai), 
9 > 0, is a one parameter family of laws introduced by Kingman in [33j . It has 
cropped up in combinatorics, population genetics, number theory, Bayesian statis- 
tics and probability theory. The interested reader may consult [201 1341 \5\ 143] for 
details of applications and extensions. We will simply define it and give some basic 
properties. 

There are two important characterizations of PDg. We will introduce both, 
since one will serve to provide intuition and the other will be useful for calcula- 
tions. We start with the so-called 'stick-breaking' construction. Let T\,T2,. be 
independent Beta(l, 6) random variables (that is, p(Tj > s) = (1 — s) ; if U is uni- 
form on [0, 1], one can check that 1 — U 1 / 9 is Beta(l, 8) distributed). Form a random 
partition from the Tj by letting the k th block take fraction of the unallocated 
mass. That is, the first block has size P\ = T±, the second Pi = Tz{\ — P%) and 
Pk+i = Tfc + i(l — P\ — . . .— Pfe). One imagines taking a stick of unit length and break- 
ing off a fraction Tk+i of what remains after k pieces have already been taken. A 
one-line induction argument shows that 1— Pi — . . .— Pfc = (1— Ti)(l— T2) • ■ • (1— Pfe), 
giving 

P fe+1 = T fe+1 (l - Pi)(l - T 2 ) ... (1 - T fc ). (7.4) 
In case it is unclear that P, = 1 almost surely, note that 

k k 



E 



=E[n( i - r o] =(/ 0t(i-*) 9-1 ) =(o+ir k ^o 

i=l i=l ^° 



(7.5) 



as k — > 00. So, the vector (Pm, Ppi) • ■ ■) of the Pj sorted into decreasing order is an 
element of A x . It determines a unique measure PD# £ A^i(A!). It is interesting 
to note that the original vector (Pi, P 2 , . . .) is obtained from (Pm , Ppi , ■ ■ ■) by size- 
biased re-ordering; its distribution is called the GEM (Griffiths-Engen-McCloskey) 
distribution. In other words, consider the interval [0, 1] partitioned into lengths 
(Pi], Ppi, . • .). Take a sequence U\,Ui,... of i.i.d. uniform random variables on 
[0,1]. Now list the blocks "discovered" by the uniforms in the order that they are 
found. The resulting sequence has the same distribution as (Pi,P2, . . .). 

7.2.1. Poisson Point processes. Kingman's original characterization of PDg was 
made in terms of a suitable random point process on R + , which is a generalization 
of the usual Poisson counting process. We now provide a crash course in the theory 
of such processes on a measurable space (X,B). (The standard reference is |35j.) 
Although we will only need this theory for X = M + , there is no extra cost for 
introducing it in general. Let M.(X) denote the set of a-finite measures on X. 
Suppose that \i e M(X) and consider the special case n{X) < 00. Thus, 
/ fj.(X) is a probability measure and we can sample, independently, points 
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Yi,Y 2 ,... according to this distribution. Let No be Poisson(/z(Jf )) distributed, 
so that P(iV = n) = ^fj c -v( x ) . Conceptually, the Poisson point process with 
intensity measure /i is simply the random collection {Y\, . . . , Y/v }. 

Formally, the point process is defined in terms of a random counting measure 
N which counts the number of random points lying in sets A € B i.e. N(A) = 
Sfc°i ly 4 eA- Thus N(A) is a random variable, which has Poisson(/u(A)) distribu- 
tion. Indeed, 

oo N 

F(N(A) =k) = Y^ H N o = «)P(X] lyeA = k No = nj 

n—k i—1 

_ ^ r -»m ( n\ \ (v(A)\ k ( KA)Y l - k 

^ n! \k\{n-k)\) I l*(X)J 



= c 

/,■ ! 

n—k 



-H(A) 



k\ 

Similar calculations show that if A\, . . . , A^ G B are disjoint then N(A{), . . . , N(Ak) 
are independent. These properties turn out to be sufficient to completely specify 
the distribution of the random measure TV. 

Definition 7.1 (Poisson point process). A Poisson point process on X with inten- 
sity \i € M(X) (or PPP(n) for short) is a random counting measure N : B(X) — > 
N U {0} U {oo} such that 

• for any A G B{X), N(A) has Poisson(n(A)) distribution. By convention, 
N(A) = oo a.s. if n{A) = oo. 

• // A\, A2, . . . , Ak e B are disjoint, the random variables N(A\),..., 
N(Ak) are independent. 

For general a-finite intensity measures, we can construct N by superposition. 
Suppose that X — \J i X i where the Xi are disjoint and fi(Xi) < 00. Use the recipe 
given at the start of this section to construct, independently a PPP(^|x;) N on 
each subspace Xi. Then N(A) — J27Li N(A) is the desired measure. It is purely 
atomic, and the atoms Yi,Y 2 , . . . are called the points of the process. In applications 
it is useful to know moments and Laplace transforms of functionals of the process. 

Lemma 7.2. 

(1) First moment: If f > or f e L 1 ^) then 



(we agree that both sides can be oo). 
(2) Campbell's formula: If f > or 1 — e~f e L 1 ^) then 



E 



exp(- £/(*•))] =exp(- f (1-e-f^Udy)) 



(we agree that exp(— 00) = 0). 
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(3) Palm's formula: Let M(X) C Ai(X) denote the space of point measures 
on X ; let G : X x Ai — > K + be a measurable functional of the points; and 
suppose f is as in (2). Then 



¥^f{Yi)G{Y u N) 



E[G(y,6 y + N)]f(y)fi(dy). 



x 



The formulation here is that of Lemma 2.3 of [§]. We include sketch proofs to 
give a flavor of the calculations involved. 

PROOF. Let / = J2k=i °k ^A k , be a simple function with /i(Ak) < oo. 
(1) We have 



E 



: E/( y *)l =K[£c k N(A k )\ =J2c kf x(A k )= f f(y)fx(dy). 
(2) We have 



(7.7) 



k=l 



k=l 



E 



= E 



-T, k c k N(A k ) \ = 



n 

IIe[ ( 



-E fc c fc AT(A fc ) 



fe=l 



(7.8) 



= ncx P (-/i(A fe )(l-e- Cfc ))=cxp(- / (l-e-'M)/i(dy)). 
fc=i ^ x 

Both (1) and (2) extend to measurable / > using standard arguments, which 
we omit. Part (1) for / £ L 1 ^) follows immediately. Part (2) for 1 — er$ g L 1 (/i) 
is also omitted. 

(3) First suppose G is of the form G(N) = exp(— J2i dO^i)) f° r some non- 
negative measurable <?. Campbell's formula gives, for q > 0, 



E 



cxp( - / (1 - e-i fiy) - g(y) )Li{dy) 



~eTq>(-qJ2f(Yi))G(N) 

i 

Differentiating this identity in q at gives 

E g /(^)G(iV)] = | /(y)e-^M(d») exp(- 



(7.9) 



_ p-g(y) 



)/*(dtf) 



/(y)e-^V(dy) E [cxp(- £ 
/(y) E [exp(- ^ 5 (y,) - . 9 (y)) 1 M (dt/) 



(7.10) 



= / /(y)E[G(AT + <y] M (dy), 



A" 



where Campbell's formula is used to get the second and last lines. 

Now, suppose G(y, N) = J2k=i c k l y eA fc exp(- £\ 9k(Xi)) for A i, ■ ■ • , A n e B 
and measurable g k : X — > [0, oo). By linearity, the preceding calculations give 
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/n 
Y,c k l y&Ah f(y)E[exp{-J29k(Yi) - g k (y))]fi(dy) 
- c fc=l i 



X 



f(y)E[G(y,N + 6 y )]n(dy). 



From here it is a standard monotone class argument. 



(7.11) 
□ 



7.2.2. The Poisson-Dirichlet distribution via a PPP. Consider the PPP with 
intensity measure given by r/(dx) = Qx~ l exp(— x)dx on [0,oo). (Note that 77 is 
an infinite measure, but is cr-finite since 77(2~ fe_1 , 2~ fe ] < 6.) A practical way to 
construct this process is given in Tavare [47] . Let T\ < T 2 < . . . be the points of a 
Poisson counting process of rate (that is, the differences T i+ i — Ti are independent 
exponential variables of rate 0) and E±,E2, ... be exponentially distributed with 
rate 1. Then, the points in our PPP(?7) can be expressed as £j = exp(— Tj)Ei, i > 1. 




Figure 7. A sample of the Poisson Point Process (points marked 
by bars) with intensity measure 77 (overlaid in gray) . Note that the 
points are dense around the origin. 



The probability that all points are less than K > is 

P(N(K, 00) = 0) = cxp^- J 9x~ l exp(-:c)dx) 1 (7.12) 

as K — > 00. Thus, there is a largest point and we can order the points in decreasing 
order so that £1 > £ 2 > • ■ • > 0. The sum YliLi ^ ^ s nn ite almost surely. Indeed, 
we can say much more. Recall that the Gamma(7, A) distribution has density 

—. . A 7 ^ 7 " 1 exp(-Ax). 
1 (7) 

Lemma 7.3. We have 



£i r~j Gamma(6', 1). 



i=i 
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PROOF. Since i s a non-negative random variable, its distribution is de- 

termined by its Laplace transform. By Campbell's formula, this is given by 



E 



expf-r y]&)] = exp(-6> J (1 - e rx )x 1 cxp(-x)dx^ 
= exp^— 9 j J exp(— x(l + r))d.xdr^ 



(7.13) 



= ^ + r)- e , 

for |r| < 1, implying that i s Gamma(#, 1) distributed. □ 

The Poisson-Dirichlct(6>) distribution, PD# e Ai(Ai), is the law of the ordered 
points, normalised by their sum, i.e. 

In the next section, we will wish to appeal to various properties of Beta and 
Gamma random variables which are often known collectively as the "Beta-Gamma 
algebra". Recall that the Beta(a, b) distribution has density pj^^y£ a (l — t) b on 
[0,1]- 

Lemma 7.4. Suppose that ~ Gamma(a, A) and <~ Gamma(/3, A) are inde- 
pendent. Then 

• + - Gammafa + /3, A), 

• r^/(r^+r^)^Beta(a,/3) ; 

• The two random variables above are independent. 

Note that the converse also follows: if B ~ Bcta(a, /3) is independent of r^ +/3 <~ 
Gamma(d + /3, A) then BT^ +fj ~ Gamma(a, A), (1 — £>)r^ +(3 <~ Gamma(/3, A) and 
these last two random variables are independent. 

PROOF. In order to simplify the notation, let X = and Y = Tp. We will 
find the joint density of S = X + Y and R = Xj (X + Y). We first find the Jacobian 
corresponding to this change of variables: we have 

dx dx 
ds dr 

— = 1 - r — = -s 
ds ' dr 

and so the Jacobian is | — rs— (1 — r)s\ = s. Noting that X = RS and Y = (l — R)S, 
we see that S and R have joint density 

s r^y Aa ( rs ) a " le " Ars fp) ((1 " r ) s )^ lg ~ A(1 ~ r)s 
~r( a + /?) A r( a )r(/3) r u rj ' 

Since this factorizes with the factors being the correct Gamma and Beta densities, 
the result follows. □ 
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In the next lemma, we will see the power of the Beta-Gamma algebra. We use 
it to make a connection between our two different representations of the Poisson- 
Dirichlet distribution. This will serve as a warm up for the calculations in the next 
section. 

Lemma 7.5. Suppose that P = (Pi,P2, . . .) ~ PD#. Let P* be a size-biased pick 
from amongst Pi, P%, Then P, ~ Beta(l, 0). 

So P* has the same distribution as the length of the first stick in the stick- 
breaking construction. 

Proof. Note that, conditional on Pi, P2, . . ., we have that 

P* = Pi with probability Pj, i > 1. (7.16) 

In order to determine the distribution of P*, it suffices to find E[/(P*)] for all 
bounded measurable test functions / : [0, 1] — > M+. (Indeed, it would suffice to find 
E[/(P*)] for all functions of the form f(x) — exp(-qx) i.e. the Laplace transform. 
However, our slightly unusual formulation will generalize better when we consider 
random variables on Ai in the next section.) Conditioning on Pi, P2, . . . and using 
the Tower Law we see that 



E[/(P*)]=E[E[/(P*)|Pi,P 2 , 



E 



OO 



(7.17) 



Now use the representation (7.14) to see that this is equal to 

This is in a form to which we can apply the Palm formula; we obtain 



E 



y 



-f 



Oil 



-K-y 



(7.18) 



(7.19) 



After cancelling y and y 1 , we recognise the density of the Exp(l) (= Gamma(l,l)) 
distribution and so we can write 



E 



-/ 



(7.20) 



where T ~ Exp(l ) is independent of ^1,^2, Recall that X)i=i d ~ Gamma(6 l , 1) 



Then by Lemma 7.4 T + YliLi nas a Gamma(0 + 1, 1) distribution and is inde- 



pendent of r/(r + zZiLi which has a Beta(l, 9) distribution. Hence, we get 



E 



E[/(B)], 



where P ~ Beta(l,6*). We conclude by observing that 

r g 1 _ 



(7.21) 



(7.22) 
□ 



We close this section by noting an important property of the PPP we use to 
create the Poisson-Dirichlet vector. 
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Lemma 7.6. The random variable Y^Li £» * s independent of 

1 



(^1,^2,^3, ■• •) • 



This is another manifestation of the independence in the Beta-Gamma algebra; 
see 1351. 



7.3. Split-merge invariance of Poisson-Dirichlet. We use the method 
that we exploited in the proof of Lemma 7.5 to prove part (a) of Theorem |7.1| 

First define a random function F : Ax Ai corresponding to (/? S5 /3m) split- 
merge as follows. Fix p £ Ai and let I(p) and J(p) be the indices of the two 
independently size-biased parts of p, that is 

P(I{p) = k)= P(J{p) =k)=p k , k> 1. (7.23) 

Now let U and V be independent U(0, 1) random variables, independent of I(p) 
and J(p). Let 



' Sfp if I(p) = J{p) = i and V < (3 S 
F(» = <( M ijP if /(p) = i ^ J(p) = j and V < /3 n 
p otherwise. 



(7.24) 



We wish to prove that if P ~ PD e then F(P) ~ PD e also. Let g : Ai -> R+ be 
a bounded measurable test function which is symmetric in its arguments (this just 
means that we can forget about ordering the elements of our sequences). Then, 
conditioning on P, considering the different cases and using the Tower Law, we 
have 



E[g(F(P))} =e|e [i v <a J2 1 Kp)=J(p)=* 9(Sf r P) 

oo 



P 



+ E 
+ E 
+ E 



P 



(7.25) 



P 



P 



1 v<^ m X! i/(p)=i ij(p)=j g{ M i P) 

Note that, conditional on P, I(P) = i, J{P) = j with probability PtPj, so that we 
get 



)=j9(P) 



E[g(F(P))} =p s E[£Pi9(S¥P)] + (1 - ft) E Pf 5 (P) 

i=l " i=l 

+ p m E /•,/',.//; .\/,../':^ + (1 - /3 m ) E PP^P) 

Now use the symmetry of g to write 

g(S%P) = g {{P k U, P k (l - U), (P)*>m^)) 

and 

g{M l3 P) = g {(Pi + P j} (P fc )fc>i,fc^)) ■ 



(7.26) 



(7.27) 



(7.28) 
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Set (P 1 ,P 2 ,...) = s j g . as in < 7 - 14 » t0 obtain 

Efe(i ? (P))]=AE 



si 



+ (1-A)E 

+ /?mE 



s 2 



;ff( y^oo 7r (6)i>l) 



fc=l(Ei=l«) V 2^i=l^ 
XI „£^ c ^fff^oo 1 7; (£< + £#> (&)*>l,fc?«j)) 



(7.29) 



+ (1- An)E 



66 



Z^i=l si 



The Palm formula (Lemma 7.2 (3)) applied to each of the expectations above 
(twice for the double sums) gives 



E[g(F(P))} =0/3 s E 

+ 0(1-A)E 
+ # 2 /3 m E 



1 



o (*+Er=i£ fe r v *+n°=i6 
i 



(zt/,x(l-[/),(6)i>i))dz 



a; 1 e x a; 2 



2 9 ( x 



(», (6)i>i) dx 



1 



x e y e y xy 



+9 2 (l - /?m)E 



1 



2/ + E^Li t 



{% + V, (6)i>i) Jdxdy 
(x,y, (&)i>i) Jdxdy 



(7.30) 



It helps to recognise the densities we are integrating over here (after cancellation). 
In the first two expectations, which correspond to split proposals, we have the den- 
sity xe~ x of the Gamma(2,l) distribution. The other density to appear is e~ x e~ y , 
which corresponds to a pair of independent standard exponential variables. Using 
Lemma 17.41 it follows that 



E[g(F(P))} = ep a E 
+ 6(1 - /3 S ) E 

+ e 2 p m E 



(r + EZ 
i 



i ( i 

a 



:b? 9 (FTIkT& (^t/ • ^<1 - ^/, • (^ ' ) ' ^ ' , ) 

( i — T^-> (&)«>l) 

(r,te)i>i) 



+ 2 (l-/3m)E 



(r + Er=i^) 2 ' 



g L^oo 7 (rt/,r(i-^),te)i>i)) 



7.6 



(7.31) 

_ r + 

Efc i s Gamma(2 + 0, 1) distributed and independent of the argument of g in all 
of the above expectations. More calculation shows that 



where T ~ Gamma(2,l), independently of (£i)i>i- By Lemmas 7.4 and 



E 



L(r + Er=i&) 2 



i) 



(7.32) 
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and so we are left with 

9(9 + 1) 



E[g(F(P))} 



E 



r + Er=i^ 



0(1^ + 0^ 



r + Efe=i 6 



9(9 + 1) 
Next use /3 S = 0/3 m to get 

0/3 s + 2 (l-/3 m ) = 2 and 0(1 - ft) + 2 /3, 
So the expression for E[g(F(P))] simplifies to 

1 



(ro,r(i-E0,(&)i>i)) 
(r,(6)i>i)) 



(f + i) 



E 



.9 



r + Efeli Cfe 



i 



(<? + i) 



E 



(m,r(i-t^),te)i>i)) 
(r,&)i>i)) 



(7.33) 



(7.34) 



(7.35) 



We can re-express this as a sum of expectations as follows: 



0(0 + 1) 



E 



-E 



- x e- y g{ 



1 



x + y + E fe =i Cfe 
l 

Efe=i Cfc 



0(0 + 1) 

Using the Palm formula in the other direction gives 
1 



(x,y, {^i)i>i)jdxdy 
(x, (&) l > 1 )jdxdy 



(7.36) 



0(0+1) 
1 



E 



0(0 + 1) 



1 1 

E&& g ( v°° f (gfc)fc>l) +E^( v co f (6)fc>l) 

Z^fe=i sfe j. = i Z^fc=i sfe 

oo 2 ^ 



r °° 

E 



(7.37) 



Once again, Efcli Cfc is independent of the argument of g. Moreover, it is easily 
shown that 

OO „-i 



E 



fe=i 



0(0 + 1), 



(7.38) 



since it is simply the second moment of a Gamma(0, 1) random variable. Thus, 

E[g(F(P))}=E[g(P)}, (7.39) 

from which the result follows. 

7.4. Split-merge in continuous time. The dynamics in the next section 
will be in continuous time, so we close this section by describing a continuous time 
version of the split-merge process. First, consider the standard Poisson counting 
process (N t , t > 0), perhaps the simplest continuous time Markov chain. Its 
trajectories take values in {0, 1,2,.. .}, are piecewise constant, increasing and right 
continuous. At each integer k, it is held for an exponentially distributed random 
time before jumping to k + 1. Consequently, only finitely many jumps are made 
during each finite time interval. We say Nt increments at rate 1. 

Continuous time split-merge is the process (P Nt , t > 0) obtained by composing 
(P k , k = 0,1,2,3,...) with an independent Poisson counting process. It is a 
Markov process in Ai with the following dynamics. Suppose the present state is 
p e Ai. Attach to each part pi an exponential alarm clock of rate f3 s p 2 and to each 
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pair (j>i,Pj) of distinct parts a clock of rate 2j3 m piPj. Wait for the first clock to 
ring. If pi's clock rings first then split pi uniformly (i.e. apply Sf with U uniform). 
If the alarm for (pi,Pj) rings first then apply My. In other words, part pi splits 
uniformly at rate ft s pi and distinct parts pi and pj merge at rate 2ft m piPj. Due to 
the memoryless property of the exponential distribution, once an alarm clock has 
rung, all of the alarm clocks are effectively reset, and the process starts over from 
the new state. 

More formally, define the rate kernel Q : Ax x B(Ai) — > [0, oo) by 

q(p,-) ■= ^y^Pi / $s? P (-)du+ p m y2PiPjSM iiP (-) (7.40) 

and the (uniformly bounded) 'rate of leaving' q : A x — > [0, oo) 

q(p) := Q(p, Ai) = ft s + ftm^PiPj- (tai) 

i vjtj 

Using standard theory (e.g. Proposition 12.20, [31]), there exists a Markov process 
on Ai that waits for an Exponential((7(p)) amount of time in state p before jumping 
to a new state chosen according to Q(p, -)/q(p). Furthermore, since 

K/3 s ,/3 m (p, ') = Q(P, O + C 1 - q(p))$p(-), ( 7 - 42 ) 

this process is constructed explicitly as (P Nt , t > 0). The coincidence of the 
invariant measures in discrete and continuous time is immediate. 

Lemma 7.7. A measure v £ A^(Ai) is invariant for the continuous time process 
(P N * , t > 0) if, and only if, it is invariant for (P k , k = 0, 1, 2, 3, . . .). 



8. Effective split-merge process of cycles and loops 

This section contains an heuristic argument that connects the loop and cycle 
models of section |6.1| and the split-merge process in section |7.4| The heuristic 
leads to the conjecture that the asymptotic normalized lengths of the cycles and 
loops have Poisson-Dirichlet distribution. By looking at the rates of the effective 
split-merge process, we can identify the parameter of the distribution. 

Consider the cycle or loop model on the cubic lattice A„ = {1, . . . , n} d in Z d . 
As hinted at in section |6.1| we expect that macroscopic cycles emerge for inverse 
temperatures f3 large enough as n — > oo. Of course, we believe this also holds for any 
sequence of sufficiently connected graphs (A„) with diverging number of vertices, 
but for simplicity we restrict attention to cubic lattices. Furthermore, since the 
same arguments apply to both the cycle and loop models, we focus on cycles and 
only mention the modifications for loops when necessary. 

Denote by A^' the length of the i th longest cycle, and recall that n m&cvo {j3) is 



the fraction of sites lying in macroscopic cycles (see Section 4.4 1 



Conjecture 8.1. Suppose d > 3. There exists j3 c > such that for ft > j3 c : 

(a) The fractions of sites in infinite and macroscopic cycles (or loops) ap- 
proach the same typical value, and 

V := Voo{ft) = f?macro(/3) > 0. 
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(b) The vector of ordered normalised cycle lengths 



rj n a rjn a 

converges weakly to a random variable £ in Ai as n — > oo. 

Assuming the conjectured result is true, what is the distribution of £? In some 
related models (the random-cluster model) , £ has been found to be the trivial (and 
non-random!) partition (1,0,0, . . .). However, we conjecture that there are many 
macroscopic cycles in our model (rather than a unique giant cycle) and that their 
relative lengths can be described explicitly by the Poisson-Dirichlet distribution. 



Conjecture 8.2. The distribution of £ in Conjecture 8.1 (b) is PDg for an appro- 
priate choice of 9. 

The rest of this section is concerned with justifying this conjecture. The reader 
may guess what the parameter should be. We will tease it out below and identify 
it in section 18.41 

See Section |6.3| for a summary of rigorous results by Schramm to support this 
conjecture on the complete graph. 

8.1. Burning and building bridges. Recall that Va„,i3.^ denotes the prob- 
ability measure for either the loop or cycle model. We define an ergodic Markov 
process on with Pa,/?,^ as invariant measure. The process evolves by adding or 
removing bridges to the current configuration. Conveniently, the effect of such an 
operation is to either split a cycle or merge two cycles. 

Lemma 8.1. Suppose uj £ Q and uj' is uj with either a bridge added (i.e. uj' = 
uj U {(e, t)} for some (e,t) £ £ x [0,/3],) or a bridge removed (i.e. uj' = uj — {(e,i)} 
for some (e, f ) £ uj). Then C(uj') is obtained by splitting a cycle or merging two 
cycles in C(uj). Similarly, C(uj') is obtained by a split or merge in C(uj). 

The point is that adding or removing a bridge never causes, for example, several 
cycles to join, a cycle to split into many pieces or the cycle structure to remain 
unchanged. 




Figure 8. Adding or removing bridges always split or merge cy- 
cles. Up to topological equivalence, this figure lists all possibilities. 

The Lemma is most easily justified by drawing pictures for the different cases. 
Suppose that we add a new bridge. Either both endpoints of the new bridge belong 
to the same cycle or two different cycles. In the former case, the cycle is split and 
we say the bridge is a self-contact. In the latter case, the two cycles are joined and 
the bridge is called a contact between the two cycles. This is illustrated in Figure 
[8] for cycles and Figure [9] for loops. 
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Suppose that we remove an existing bridge. Again, either both of the bridge's 
endpoints belong to the same cycle (self-contact) or they are in different cycles 
(contact between the two cycles). In the former case, removal splits the cycle and 
in the latter, the two cycles are joined. 

As this argument hints, it is helpful to formally define the 'contacts' between 
cycles. Suppose that 7 G C(uj) is a cycle. Recall from Section pTT| that this means 
7(7") = (x(t), t(r)), t > is a closed trajectory in V x [0, (3] peT , where x is piecewise 
constant and has a jump discontinuity across the edge e = (x(t— ), x(r)) G £ at 
time t if, and only if, the bridge (e, t(r)) is present in w. Such bridges are called self 
contact bridges, the set of which is denoted *B 7 . Removing a bridge from 23 7 c uj 
causes 7 to split. 




Figure 9. Same as Figure |8j but for loops instead of cycles. 

The self contact zone £ 7 of 7 is the set of (e, r) G £ x [0, 0] for which e = 
(x(t),x(t + jj3)) for some integer j, i.e. the (e,t) bridge touches different legs of 
7's trajectory and so adding a bridge from £ 7 splits 7. 

The contact bridges 25 7i7 ' and zones £ 7l7 ' between distinct cycles 7,7' G C(u) 
are defined similarly. Specifically, 2$ 7)7 ' C w is comprised of bridges in w that 
are traversed by 7 = (x,t) and 7' = (x',t'), i.e. (e, t) G u> such that e = (ic(t + 
ji0),x'(t + j2/3)) for some integers ji,j2- Removal of a bridge in 58 7i7 ' causes 7 
and 7' to merge. 

£ 7i7 ' is the set of (e,i) G £ x [0,/3] such that e = + jif3), x(t + for 
some ji,j2, i-e. those bridges that would merge 7 and 7'. Note that the contact 
(and self contact) zones partition £ x [0,0] while the contact bridges partition ui. 

8.2. Dynamics. The promised PA./3,«-invariant Markov process, denoted 
(3Et)t>o is defined as follows. Suppose that a > 0. 

• A new bridge appears in (e, dt) at rate $"cft if its appearance causes a 
cycle to split and at rate if it causes two cycles to join. 

• An existing bridge is removed at rate i9 1-Q if its removal causes a cycle 
to split and at rate $ _ ( 1_Q; ) if its removal causes two cycles to join. 

• No other transitions occur. 

The rates are not uniformly bounded, so a little effort is required to check X 
is well behaved (does not 'explode'). Accepting this, we can show X is actually 
reversible with respect to our cycle model. 

Lemma 8.2. The unique invariant measure of X is Pa,/?,^. 

The proof is straightforward and so we omit it. 

In the sequel we take a — 1/2, so that adding and removing bridges occur at 
the same rates. 
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8.3. Heuristic for rates of splitting and merging of cycles. As we 

know, adding or removing bridges causes cycles to split or merge so the dynamics 
(C(X(), t > 0) that X induces on cycles is a kind of coagulation-fragmentation pro- 
cess. However, these dynamics are not Markovian and depend on the underlying 
process in a complicated manner. Ideally we would like a simpler, more transparent 
description for the dynamics. The first step towards this is to rewrite the transition 
rates for X in terms of the contact zones and bridges. 

Suppose that X is currently in state ui £ £1. A cycle 7 £ C(u>) splits if either a 
bridge from (£ 7 is added, or a bridge from 03 7 C u) is removed. The total rate at 
which these transitions occur is 

^(|Q3 7 | + |e: 7 |), (8.1) 

where |£ 7 | = J2 e <a£ Leb({t £ [0,(3] : (e,t) £ £ 7 }) is the (one-dimensional) Lebesgue 
measure of the self contact zone. Two distinct cycles 7 and 7' merge if a bridge 
from £ 7l7 ' is added or one from Q3 7 . 7 < removed. The combined rate is 

V^flaB^I + lty.yl), (8.2) 

where = £ ee£ Leb({t e [0,P] : (e,t) £ £ 7 , r }). 

8.3.1. Heuristics. We believe that, for suitably connected graphs and large 
enough /3, cycles should be macroscopic. The trajectories of these cycles should 
spread evenly over all edges and vertices in the graph. In particular, macroscopic 
cycles should come into contact with each other many times and we expect some 
averaging phenomenon to come into play. The longer a cycle is, on average, the 
more intersections with other cycles it should have. In particular, we believe the 
contact zone between two macroscopic cycles should have size proportional to the 
cycles' length. 

That is, if 7 and 7' are cycles with lengths A and A' respectively then there is 
a 'law of large numbers' 

\Cy\ ~ ic 2 A 2 , |Q3 7 | ~ ^ Cl A 2 (8.3) 

and 

|£ 7 ,y| ~ c 2 AA', |Q3 7 ,y| ~ ciAA', (8.4) 

for constants C\ and c 2 (the notation X ~ Y means that the ratio of the random 
variables converges to 1 in probability as A„ grows). 

The constants may depend on i? and j3 and the graph geometry. We believe 
they are linear in /3 but do not depend on Note that the size of the contact zones 
can be calculated easily for the complete graph. We get 

\&y tY \ = /3AA', |£ 7 | = f A(A - 1). (s.s) 

In the case $ = 1, we also have numerical support for 

|«8 7 , y | ~p\\', |Q5 T | - f A 2 . (8.6) 

8.4. Connection to uniform split-merge. Continuing with the heuristic, 
C(X) is 'n early ' a M arko v pro cess in which cycles split and merge. Substituting 



3) into (8.1) and (8.4 1 into (8.2 f, and multiplying by 2y/d(a + c 2 ) (which just 
changes the speed of the process, not its invariant measure) we see that a cycle of 
length A splits at rate "#A 2 , while two cycles with lengths A and A' merge at rate 
2AA'. There seems no reason to suppose that splits are not uniform. 
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Suddenly there are many similarities between C(X) and the continuous time 
split-merge process of section |7.4| This suggests that Poisson-Dirichlet PDg is 
lurking somewhere in the normalised cycle length distribution. What is the right 
choice of the parameter 91 

Write z9 = /3 s //3 m , Ps,Pm € (0,1] and multiply the rates by /3 m to see that 
a cycle of length A splits uniformly at rate /3 S A 2 , while two cycles with lengths A 
and A' merge at rate 2/3 m AA'. Up to the normalising factor (which is close to the 
constant 77macro|A„|), these are exactly the rates in section 7.4 Thus, the parameter 
9 should be equal to d. This fact was initially not obvious. 



References 

[1] M. Aizenman and B. Nachtergaele. Geometric aspects of quantum spin states. Comm. Math. 

Phys., 164(l):17-63, 1994. 
[2] D. Aldous. Deterministic and stochastic models for coalescence (aggregation and coagulation): 

a review of the mean-field theory for probabilists. Bernoulli, 5(l):3-48, 1999. 
[3] G. Alon and G. Kozma. The probability of long cycles in interchange processes. 

http://arxiv.org/abs/1009.3723, 2010. 
[4] O. Angel. Random infinite permutations and the cyclic time random walk. In Discrete random 

walks (Paris, 2003), Discrete Math. Theor. Comput. Sci. Proc, AC, pages 9-16 (electronic). 

Assoc. Discrete Math. Theor. Comput. Sci., Nancy, 2003. 
[5] R. Arratia, A. D. Barbour, and S. Tavare. Logarithmic combinatorial structures: a proba- 
bilistic approach. EMS Monographs in Mathematics. European Mathematical Society (EMS), 

Zurich, 2003. 

[6] N. Bcrcstycki. Emergence of giant cycles and slowdown transition in random transpositions 

and fc-cycles. Electr. J. Probab., 16:152-173, 2011. 
[7] V. L. Berezinskil. Destruction of long-range order in one-dimensional and two-dimensional 

systems having a continuous symmetry group i. classical systems. Soviet J. Exper. Theor. 

Phys., 32:493-500, 1971. 
[8] J. Bertoin. Random fragmentation and coagulation processes, volume 102 of Cambridge Stud- 
ies in Advanced Mathematics. Cambridge University Press, Cambridge, 2006. 
[9] V. Betz and D. Ueltschi. Spatial random permutations and Poisson-Dirichlet law of cycle 

lengths. Electr. J. Probab., 16:1173-1192, 2011. 
[10] M. Biskup. Reflection positivity and phase transitions in lattice spin models. In Methods of 

contemporary mathematical statistical physics, volume 1970 of Lecture Notes in Math., pages 

1-86. Springer, Berlin, 2009. 
[11] C. Borgs, R. Kotecky, and D. Ueltschi. Low temperature phase diagrams for quantum per- 
turbations of classical spin systems. Comm. Math. Phys., 181(2):409— 446, 1996. 
[12] L. Chayes, L. P. Pryadko, and K. Shtengel. Intersecting loop models on Z d : rigorous results. 

Nuclear Phys. B, 570(3) :590-614, 2000. 
[13] J. G. Conlon and J. P. Solovej. Upper bound on the free energy of the spin 1/2 Heisenberg 

ferromagnet. Lett. Math. Phys., 23(3):223-231, 1991. 
[14] N. Crawford and D. Ioffc. Random current representation for transverse field Ising model. 

Comm. Math. Phys., 296(2) :447-474, 2010. 
[15] N. Datta, R. Fernandez, and J. Frohlich. Low-temperature phase diagrams of quantum lattice 

systems. I. Stability for quantum perturbations of classical systems with finitely-many ground 

states. J. Statist. Phys., 84(3-4) :455-534, 1996. 
[16] P. Diaconis, E. Mayer-Wolf, O. Zeitouni, and M. P. W. Zerner. The Poisson-Dirichlet law 

is the unique invariant distribution for uniform split-merge transformations. Ann. Probab., 

32(lB):915-938, 2004. 

[17] F. J. Dyson, E. H. Lieb, and B. Simon. Phase transitions in quantum spin systems with 
isotropic and nonisotropic interactions. J. Statist. Phys., 18(4):335-383, 1978. 

[18] P. Erdos and A. Renyi. On the evolution of random graphs. Magyar Tud. Akad. Mat. Kutatd 
Int. Kozl., 5:17-61, 1960. 

[19] W. G. Faris. Outline of quantum mechanics. In Entropy and the quantum, volume 529 of 
Contemp. Math., pages 1-52. Amer. Math. Soc, Providence, RI, 2010. 



HEISENBERG MODELS AND THEIR PROBABILISTIC REPRESENTATIONS 



223 



[20] S. Feng. The Poisson-Dirichlet distribution and related topics. Probability and its Applica- 
tions (New York). Springer, Heidelberg, 2010. Models and asymptotic behaviors. 

[21] G. B. Folland. Real analysis. Pure and Applied Mathematics (New York). John Wiley & 
Sons Inc., New York, second edition, 1999. Modern techniques and their applications, A 
Wiley-Interscience Publication. 

[22] J. Frohlich, R. Israel, E. H. Lieb, and B. Simon. Phase transitions and reflection positivity. 

I. General theory and long range lattice models. Comm. Math. Phys., 62(l):l-34, 1978. 
[23] J. Frohlich, R. B. Israel, E. H. Lieb, and B. Simon. Phase transitions and reflection positivity. 

II. Lattice systems with short-range and Coulomb interactions. J. Statist. Phys., 22(3):297- 
347, 1980. 

[24] J. Frohlich, B. Simon, and T. Spencer. Infrared bounds, phase transitions and continuous 

symmetry breaking. Comm. Math. Phys., 50(l):79-95, 1976. 
[25] J. Frohlich and T. Spencer. The Kosterlitz-Thouless transition in two-dimensional abelian 

spin systems and the Coulomb gas. Comm. Math. Phys., 81(4):527-602, 1981. 
[26] D. Gandolfo, J. Ruiz, and D. Ueltschi. On a model of random cycles. J. Stat. Phys., 

129(4):663-676, 2007. 

[27] J. Ginibre. Existence of phase transitions for quantum lattice systems. Comm. Math. Phys., 
14(3):205-234, 1969. 

[28] G. R. Grimmett. Space-time percolation. In In and out of equilibrium. 2, volume 60 of Progr. 
Probab., pages 305-320. Birkhauser, Basel, 2008. 

[29] T. E. Harris. Nearest-neighbor Markov interaction processes on multidimensional lattices. 
Advances in Math., 9:66-89, 1972. 

[30] D. Ioffe. Stochastic geometry of classical and quantum Ising models. In Methods of contempo- 
rary mathematical statistical physics, volume 1970 of Lecture Notes in Math., pages 87-127. 
Springer, Berlin, 2009. 

[31] O. Kallenberg. Foundations of modern probability. Probability and its Applications (New 
York). Springer- Vcrlag, New York, second edition, 2002. 

[32] T. Kennedy, E. H. Lieb, and B. S. Shastry. Existence of Neel order in some spin-^ Heisenberg 
antifcrromagnets. J. Statist. Phys., 53(5-6):1019-1030, 1988. 

[33] J. F. C. Kingman. Random discrete distributions. J. Roy. Statist. Soc. Ser. B, 37:1—15, 1975. 
With a discussion by S. J. Taylor, A. G. Hawkcs, A. M. Walker, D. R. Cox, A. F. M. Smith, 
B. M. Hill, P. J. Burvillc, T. Leonard and a reply by the author. 

[34] J. F. C. Kingman. Mathematics of genetic diversity, volume 34 of CBMS-NSF Regional 
Conference Series in Applied Mathematics. Society for Industrial and Applied Mathematics 
(SIAM), Philadelphia, Pa., 1980. 

[35] J. F. C. Kingman. Poisson processes, volume 3 of Oxford Studies in Probability. The Claren- 
don Press Oxford University Press, New York, 1993. Oxford Science Publications. 

[36] J. M. Kosterlitz and D. J. Thouless. Ordering, metastability and phase transitions in two- 
dimensional systems. Journal of Physics C: Solid State Physics, 6(7):1181, 1973. 

[37] E. Mayer-Wolf, O. Zeitouni, and M. P. W. Zerner. Asymptotics of certain coagulation- 
fragmentation processes and invariant Poisson-Dirichlet measures. Electron. J. Probab., 7:no. 
8, 25 pp. (electronic), 2002. 

[38] N. D. Mermin and H. Wagner. Absence of ferromagnetism or antiferromagnctism in one- or 
two-dimensional isotropic heisenberg models. Phys. Rev. Lett., 17(22):1133-1136, Nov 1966. 

[39] B. Nachtergaele. Quantum spin systems after DLS 1978. In Spectral theory and mathematical 
physics: a Festschrift in honor of Barry Simon's 60th birthday, volume 76 of Proc. Sympos. 
Pure Math., pages 47-68. Amer. Math. Soc., Providence, RI, 2007. 

[40] E. J. Neves and J. F. Perez. Long range order in the ground state of two-dimensional anti- 
fcrromagnets. Physics Letters A, 114(6):331 - 333, 1986. 

[41] J. Pitman. Poisson-Dirichlet and GEM invariant distributions for split-and-mcrgc transfor- 
mation of an interval partition. Combin. Probab. Comput., 11(5):501— 514, 2002. 

[42] J. Pitman. Combinatorial stochastic processes, volume 1875 of Lecture Notes in Mathematics. 
Springer- Verlag, Berlin, 2006. Lectures from the 32nd Summer School on Probability Theory 
held in Saint-Flour, July 7-24, 2002, With a foreword by Jean Picard. 

[43] J. Pitman and M. Yor. The two-parameter Poisson-Dirichlet distribution derived from a 
stable subordinator. Ann. Probab., 25(2):855-900, 1997. 

[44] D. Ruelle. Statistical mechanics: Rigorous results. W. A. Benjamin, Inc., New York- 
Amsterdam, 1969. 



224 



C. GOLDSCHMIDT, D. UELTSCHI, AND P. WINDRIDGE 



[45] O. Schramm. Compositions of random transpositions. Israel J. Math., 147:221-243, 2005. 

[46] B. Simon. The statistical mechanics of lattice gases. Vol. I. Princeton Series in Physics. 
Princeton University Press, Princeton, NJ, 1993. 

[47] S. Tavare. The birth process with immigration, and the genealogical structure of large pop- 
ulations. J. Math. Biol., 25(2):161-168, 1987. 

[48] B. Toth. Improved lower bound on the thermodynamic pressure of the spin 1/2 Hcisenberg 
ferromagnet. Lett. Math. Phys., 28(l):75-84, 1993. 

[49] B. Toth. Reflection positivity, infrared bounds, continuous symmetry breaking, 
http: / /www. math. bme.hu/~balint/oktatas/statisztikus_fizika/jcgyzct/, 1996. 

[50] N. Tsilevich. On the simplest split-merge operator on the infinite-dimensional simplex. Arxiv 
preprint math/0106005, 2001. 

[51] N. V. Tsilevich. Stationary random partitions of a natural series. Teor. Veroyatnost. i Prime- 
nen., 44(l):55-73, 1999. 

Departments of Mathematics and Statistics, University of Warwick, Coventry, CV4 
7AL, United Kingdom 

E-mail address: c. a. goldschmidtSwarwlck.ac.uk 

E-mail address: daniel@ueltschi.org 

E-mail address: peter.windridge@warwick.ac.uk 



